WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 



PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 
C12N 



A2 



(11) International Publication Number: WO 97/07198 

(43) International Publication Date: 27 February 1997 (27.02.97) 



(21) International Application Number: PCT/US96V 12897 

(22) International Filing Date: 8 August 1996 (08.08.96) 



(30) Priority Date: 

Not furnished 



11 August 1995 (11.08.95) 



US 



(71) Applicant: GENETICS INSTITUTE, INC. [US/US]; 87 Cam- 

bridgePark Drive, Cambridge, MA 02140 (US). 

(72) Inventors: JACOBS, Kenneth; 151 Beaumont Avenue, New- 

ton, MA 02160 (US). MCCOY, John, M.; 63 Pine Ridge 
Road, Reading, MA 01867 (US). KELLEHER, Kerry; 50 
Hurley Circle, Marlborough. MA 01752 (US). CARLIN, 
McKeough; 16 Chauncy Street #22, Cambridge, MA 02138 
(US). 

(74) Agent: BROWN, Scott, A.; Genetics Institute, Inc., Legal 
Affairs, 87 CambridgePark Drive, Cambridge, MA 02140 

(US). 



(81) Designated States: AU, CA, JP, MX, European patent (AT, 
BE, CH f DE, DK. ES. FI. FR. GB, GR. IE, IT, LU, MC, 
NL, PT, SE). 



Published 

Without international search report and to be republished 
upon receipt of that report. 



(54) Title: DNA SEQUENCES AND SECRETED PROTEINS ENCODED THEREBY 
(57) Abstract 

Novel polynucleotides and the proteins encoded thereby are disclosed. 



i 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AM 


Annenit 


AT 


Austria 


AU 


Australia 


BB 


Barbados 


BE 


Belgium 


BF 


Burkina Faso 


BG 


Bulgaria 


BJ 


Benin 


BR 


Brazil 


BY 


Belarus 


CA 


Canada 


CF 


Centra] African Republic 


CG 


Congo 


CH 


Switzerland 


a 


Cote d'lvoire 


CM 


Cameroon 


CN 


China 


CS 


Czechoslovakia 


CZ 


Czech Republic 


DE 


Germany 


DK 


Denmark 


EE 


Estonia 


ES 


Spam 


n 


Finland 


FR 


France 


GA 


Gabon 



GB 


United Kingdom 


GE 


Georgia 


GN 


Guinea 


GR 


Greece 


HU 


Hungary 


IE 


Ireland 


IT 


Italy 


JP 


Japan 


KE 


Kenya 


KG 


Kyrgyttan 


KP 


Democratic People's Republic 




of Korea 


KR 


Republic of Korea 


KZ 


Kazakhstan 


U 


Liechtenstein 


LK 


Sri Lanka 


LR 


Liberia 


LT 




LU 


Luxembourg 


LV 


Latvia 


MC 


Monaco 


MD 


Republic of Moldova 


MG 


Madagascar 


ML 


Mali 


MN 


Mongolia 


MR 


Mauritania 



MW 


Malawi 


MX 


Mexico 


NE 


Niger 


NL 


Netherlands 


NO 


Norway 


NZ 


New Zealand 


PL 


Poland 


PT 


Portugal 


RO 


Romania 


RU 


Russian Federation 


SD 


Sudan 


SE 


Sweden 


SG 


Singapore 


SI 


Slovenia 


SK 


Slovakia 


SN 


Senegal 


sz 


Swaziland 


TD 


Chad 


TG 


Togo 


TJ 


Tajikistan 


TT 


Trinidad and Tobago 


UA 


Ukraine 


UG 


Uganda 


US 


United States of America 


UZ 


Uzbekistan 


VN 


Viet Nam 



X 

WO 97/07198 



PCT/US96/12897 



DNA SEQUENCES AND SECRETED PROTEINS ENCODED THEREBY 

5 This application claims priority from application Ser. No. 08/514,014, filed on 
August 11, 1995, which was converted to provisional application Ser. No. 
60/ on July 19, 1996. 

FIELD OF THE INVENTION 
1 0 The present invention provides novel polynucleotides and proteins encoded by 

such polynucleotides, along with therapeutic, diagnostic and research utilities for these 
polynucleotides and proteins. 

BACKGROUND OF THE INVENTION 
1 5 Technology aimed at the discovery of protein factors (including e.g., cytokines, 

such as lymphokines, interferons, CSFs and interleukins) has matured rapidly over the 
past decade. The now routine hybridization cloning and expression cloning 
techniques clone novel polynucleotides "directly" in the sense that they rely on 
information directly related to the discovered factor (i.e., partial DNA/amino acid 
20 sequence of the factor in the case of hybridization cloning; activity of the factor in the 
case of expression cloning). More recent "indirect" cloning techniques such as signal 
sequence cloning, which isolates DNA sequences based on the presence of a now 
well-recognized secretory leader sequence motif, as well as various PCR-based or low 
stringency hybridization cloning techniques, have advanced the state of the art by 
25 making available large numbers of DNA/amino acid sequences for factors that are 
known to have biological activity by virtue of their secreted nature in the case of 
leader sequence cloning, or by virtue of the cell or tissue source in the case of PCR- 
based techniques. It is to these factors and the polynucleotides encoding them that the 
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SUMMARY 

In one embodiment, the present invention provides a composition comprising 
an isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ 
ID NO: 1 from nucleotide 38 to nucleotide 1447; 

(b) a polynucleotide comprising a fragment of the nucleotide 
sequence of SEQ ID NO: 1 encoding a protein having biological activity; 

(c) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO: 2; 

(d) a polynucleotide encoding a protein comprising a fragment of 
the amino acid sequence of SEQ ID NO:2 having biological activity; 

(e) a polynucleotide which is an allelic variant of SEQ ID NO: 1 ; 

and 

(f) a polynucleotide capable of hybridizing under stringent 
conditions to any one of the polynucleotides specified in (a)-(e). 

In another embodiment, the present invention provides a composition 
comprising an isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ 
ID NO:3 from nucleotide 52 to nucleotide 2034; 

(b) a polynucleotide comprising a fragment of the nucleotide 
sequence of SEQ ID NO:3 encoding a protein having biological activity; 

(c) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ED NO:4; 

(d) a polynucleotide encoding a protein comprising a fragment of 
the amino acid sequence of SEQ ID NO:4 having biological activity; 

(e) a polynucleotide which is an allelic variant of SEQ ID NO:4; 

and 

(f) a polynucleotide capable of hybridizing under stringent 
conditions to any one of the polynucleotides specified in (a)-(e). 

In another embodiment, the present invention provides a composition 
comprising an isolated polynucleotide selected from the group consisting of: 
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5 (a) a polynucleotide comprising the nucleotide sequence of SEQ 

ID NO:5 from nucleotide 76 to nucleotide 474; 

(b) a polynucleotide comprising a fragment of the nucleotide 
sequence of SEQ ID NO:5 encoding a protein having biological activity; 

(c) a polynucleotide encoding a protein comprising the amino acid 
10 sequence of SEQ ID NO:6; 

(d) a polynucleotide encoding a protein comprising a fragment of 
the amino acid sequence of SEQ ID NO:6 having biological activity; 

(e) a polynucleotide which is an allelic variant of SEQ ID NO:5; 

and 

15 (f) a polynucleotide capable of hybridizing under stringent 

conditions to any one of the polynucleotides specified in (a)-(e). 
In another embodiment, the present invention provides a composition 
comprising an isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ 
20 ID NO: 7 from nucleotide 67 to nucleotide 348; 

(b) a polynucleotide comprising a fragment of the nucleotide 
sequence of SEQ ID NO:7 encoding a protein having biological activity; 

(c) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:8; 

25 (d) a polynucleotide encoding a protein comprising a fragment of 

the amino acid sequence of SEQ ID NO:8 having biological activity; 

(e) a polynucleotide which is an allelic variant of SEQ ID NO:7; 

and 

(f) a polynucleotide capable of hybridizing under stringent 
30 conditions to any one of the polynucleotides specified in (a)-(e). 

In another embodiment, the present invention provides a composition 
comprising an isolated polynucleotide selected from the group consisting of: 

_o it)i a polynucleotide comprising a iiagmciu o: Uil- iiuacoiUR 

sequence of SEQ ID NO:9 encoding a protein having biological activity, 
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5 (c) a polynucleotide encoding a protein comprising the amino acid 

sequence of SEQ ID NO: 10; 

(d) a polynucleotide encoding a protein comprising a fragment of 
the amino acid sequence of SEQ ED NO: 10 having biological activity; 

(e) a polynucleotide which is an allelic variant of SEQ ID NO:9; 

10 and 

(f) a polynucleotide capable of hybridizing under stringent 
conditions to any one of the polynucleotides specified in (a)-(e). 

In another embodiment, the present invention provides a composition 
comprising an isolated polynucleotide selected from the group consisting of: 
15 (a) a polynucleotide comprising the nucleotide sequence of SEQ 

ID NO: 1 1 from nucleotide 86 to nucleotide 544; 

(b) a polynucleotide comprising a fragment of the nucleotide 
sequence of SEQ ID NO: 1 1 encoding a protein having biological activity; 

(c) a polynucleotide encoding a protein comprising the amino acid 
20 sequence of SEQ ID NO: 12; 

(d) a polynucleotide encoding a protein comprising a fragment of 
the amino acid sequence of SEQ ID NO: 12 having biological activity; 

(e) a polynucleotide which is an allelic variant of SEQ ID NO: 1 1 ; 

and 

25 (f) a polynucleotide capable of hybridizing under stringent 

conditions to any one of the polynucleotides specified in (a)-(e). 
In certain preferred embodiments, the polynucleotide is operably linked to an 
expression control sequence. The invention also provides a host cell, including 
bacterial, yeast, insect and mammalian cells, transformed with such polynucleotide 
30 compositions. 

Processes are also provided for producing a protein, which comprise: 

(a) growing a culture of the host cell transformed with such 
polynucleotide compositions in a suitable culture medium; and 

(b) purifying the protein from the culture. 

35 The protein produced according to such methods is also provided by the present 
invention. 
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5 Compositions comprising a protein biological activity are also disclosed. In 

preferred embodiments the protein comprises an amino acid sequence selected from 
the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:2; 

(b) fragments of the amino acid sequence of SEQ ID NO: 2; 
1 0 (c) the amino acid sequence of SEQ ID NO:4; 

(d) fragments of the amino acid sequence of SEQ ID NO:4; 

(e) the amino acid sequence of SEQ ID NO:6; 

(f) fragments of the amino acid sequence of SEQ ID NO:6; 

(g) the amino acid sequence of SEQ ID NO:8; 

15 (h) fragments of the amino acid sequence of SEQ ID NO:S; 

(i) the amino acid sequence of SEQ ID NO: 12; and 
(j) fragments of the amino acid sequence of SEQ ID NO: 12; 
the protein being substantially free from other mammalian proteins. 

Such compositions may further comprise a pharmaceutically acceptable carrier. 
20 Compositions comprising an antibody which specifically reacts with such protein are 
also provided by the present invention. 

Methods are also provided for preventing, treating or ameliorating a medical 
condition which comprises administering to a mammalian subject a therapeutically 
effective amount of a composition comprising a protein of the present invention and 
25 a pharmaceutically acceptable carrier. 



BRIEF DESCRIPTION OF FIGURES 
Fig. 1 is an autoradiograph evidencing the expression of clone J5 in COS cells 
(indicated by arrows). J5 is processed into multiple bands, with the major band at 
30 approximately 58 kD. 

Fig. 2 is an autoradiograph evidencing the expression of clone L105 in COS 
cells (indicated by arrows). 

x rig. * ls an amoradiograpn evidencing tile expresMun o; cionc bio iu COS 

cells (indicated by arrows). 
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DETAILED DESCRIPTION 

ISOLATED PROTEINS AND POLYNUCLEOTIDES 

The sequence of a polynucleotide encoding one protein of the present 
invention is set forth in SEQ ID NO:l, with the coding region extending from 
nucleotides 38 to 1447. This polynucleotide has been identified as "clone J5" The 
amino acid sequence of the protein encoded by clone J5 is set forth in SEQ ID NO:2. 
Clone J5 was deposited with the American Type Culture Collection on August 1 1, 
1995 and given the accession number ATCC 69885. SEQ ID NO:l represents a 
spliced combination of sequence obtained from an isolated clone identified as 
M J5_3_fl", with additional 5* sequence obtained from a second double stranded clone. 
Clone J5 was isolated from a human activated peripheral blood mononuclear cell 
(PBMC) library using a trap which selects for nucleotides encoding secreted proteins; 
therefore, clone J5 does encode a secreted factor. J5 encodes a novel protein; 
BLASTN/BLASTX or FASTA searches revealed no exact sequence matches. 
However, a BLASTX search revealed homology between the J5 protein (in the 
approximate region of amino acids 62-129 of SEQ ID NO:2), epididymal apical 
proteins (including without limitation, epididymal apical protein I-precursor {Macaca 
fascicularis) (accession X66139)) and several snake venom haemorrhagic peptides 
(disintegrins) (including without limitation those assigned accession U0 1235- 1237, 
X68251, and M89784). Analysis of the full-length J5 sequences revealed that the 
disintegrin domain was incomplete and that this clone did not contain an EGF-domain, 
as with some of the other disintegrin family members. J5 does contain a conserved 
metallo-proteinase domain. Based upon these homologies, J5 and these homologous 
proteins are expected to share at least some activities. 

The sequence of a polynucleotide encoding another protein of the present 
invention is set forth in SEQ ID NO:3, with the coding region extending from 
nucleotides 52 to 2034. This polynucleotide has been identified as "clone J422" The 
amino acid sequence of the protein encoded by clone J422 is set forth in SEQ ID 
NO:4. Clone J422 was deposited with the American Type Culture Collection on 
August 11, 1995 and given the accession number ATCC 69884. SEQ ID NO:3 
represents a spliced combination of sequence obtained from an isolated clone 
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5 identified as "J422_fT, with additional 5' sequence obtained from a second double 
stranded clone. Clone J422 was isolated from a human activated peripheral blood 
mononuclear cell (PBMC) library using a trap which selects for nucleotides encoding 
secreted proteins; therefore, clone J422 does encode a secreted factor. J422 encodes 
a novel protein; BLASTN/BLASTX or FASTA searches revealed no exact sequence 

10 matches. However, a FASTA search revealed homology between the J422 protein (in 
the approximate region of amino acids 34-156 of SEQ ID NO:4) and a number of 
Drosophila leucine-rich repeat (LRR) proteins. Analysis of the full-length J422 
sequences revealed that the conserved EGF-domain found in a number of LRR family 
members was not present in J422. Based upon these homologies, J422 and these 

15 homologous proteins arc expected to share at least some activities. 

The sequence of a polynucleotide encoding another protein of the present 
invention is set forth in SEQ ID NO:5, with the coding region extending from 
nucleotides 76 to 474. This polynucleotide has been identified as "clone LI 05" The 
amino acid sequence of the protein encoded by clone LI 05 is set forth in SEQ ID 

20 NO:6. Clone L105 was deposited with the American Type Culture Collection on 
August 1 1, 1995 and given the accession number ATCC 69883. Clone L105 was 
isolated from a murine adult thymus library using a trap which selects for nucleotides 
encoding secreted proteins; therefore, clone LI 05 does encode a secreted factor. LI 05 
encodes a novel protein; BLASTN/BLASTX or FASTA searches revealed no exact 

25 sequence matches. However, a BLASTX search revealed homology between the LI 05 
protein (particularly in the approximate region of amino acids 73-91 of SEQ ID 
NO:6), various monocyte and other chemoattractant proteins (including without 
limitation those assigned accession M577441, X71087, X72308, X14768 and 
M24545) and a chicken (Gallus gallus) cytokine (accession L34553). Based upon 

30 these homologies, L105 and these homologous proteins are expected to share at least 
some activities. 

The sequence of polynucleotides encoding another protein of the present 



polynucleotides nave oeen identified a> uone 11174-10 and clone f 1 174-4 / . 
respectively (collectively referred to herein as "H174"). The amino acid sequence of 
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the protein encoded by clones HI 74 is set forth in SEQ ID NO:8 and SEQ ID NO: 10. 
Clone H174 was deposited with the American Type Culture Collection on August 1 1, 
1995 and given the accession number ATCC 69882. Clones HI 74 were isolated from 
a human activated peripheral blood mononuclear cell (PBMC) library using a trap 
which selects for nucleotides encoding secreted proteins; therefore, HI 74 does encode 
a secreted factor. HI 74 encodes a novel protein; BLASTN/BLASTX or FASTA 
searches revealed no exact sequence matches. However, a BLASTX search revealed 
homology between the H174 protein, human IP-10 (accession M33266) and murine 
CRG-2 (accession M86820) (species homologs). Based upon these homologies, HI 74 
and these homologous proteins are expected to share at least some activities. 

The sequence of a polynucleotide encoding another protein of the present 
invention is set forth in SEQ ID NO: 11, with the coding region extending from 
nucleotides 86 to 544. This polynucleotide has been identified as "B18" The amino 
acid sequence of the protein encoded by clone B18 is set forth in SEQ ID NO: 12. 
Clone B 18 was deposited with the American Type Culture Collection on July 6, 1995 
and assigned accession number ATCC 69868. Clone B 18 was isolated from a human 
activated peripheral blood mononuclear cell (PBMC) library using a trap which selects 
for nucleotides encoding secreted proteins; therefore, clone B18 does encode a 
secreted factor. B 1 8 encodes a novel protein; BLASTN/BLASTX or FASTA searches 
revealed no exact sequence matches. However, a BLASTX search revealed that the 
region from amino acid 29 to amino acid 163 of B 18 (SEQ ID NO: 12) shows marked 
homology to portions of murine CTLA-8 (amino acids 1 8 to 150, accession L13839) 
and herpesvirus SaimiriORFl 3 ("herpes CTLA-8") (amino acids 19to 151, accession 
X64346). Based upon these homologies, B18 is believed to be the human homolog 
of murine and herpes CTLA-8 (i.e., "human CTLA-8"). B 18 may demonstrate 
proinflammatory activity, particularly in development of T-cell dependent immune 
responses. B18 is also expected to possess other activities specified herein. 

Clones J5, L105, H174 and B 18 were each transfected into COS cells labelled 
with 33 S-methionine and protein was expressed. Autoradiographs evidencing 
expression of the proteins in conditioned media are presented in Figs. 1, 2, 3 and 4, 
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5 respectively. The bands of protein expressed from the relevant clone are indicated by 
arrows. 

Polynucleotides hybridizing to the polynucleotides of the present invention 
under stringent conditions and highly stringent conditions are also part of the present 
invention. As used herein, "highly stringent conditions" include, for example, at least 

10 about 0.2xSSC at 65°C; and "stringent conditions" include, for example, at least about 
4xSSC at 65°C or at least about 50% formamide, 4xSSC at 42°C. Allelic variants of 
the polynucleotides of the present invention are also encompassed by the invention. 

Fragments of the proteins of the present invention which are capable of 
exhibiting biological activity are also encompassed by the present invention. 

15 Fragments of the protein may be in linear form or they may be cyclized using known 
methods, for example, as described in H.U. Saragovi, et al* Bio/Technology K), 773- 
778 (1992) and in R.S. McDowell, et al, J. Amer. Chem. Soc. U4, 9245-9253 (1992), 
both of which are incorporated herein by reference. Such fragments may be fused to 
carrier molecules such as immunoglobulins for many purposes, including increasing 

20 the valency of protein binding sites. For example, fragments of the protein may be 
fused through "linker" sequences to the Fc portion of an immunoglobulin. For a 
bivalent form of the protein, such a fusion could be to the Fc portion of an IgG 
molecule. Other immunoglobulin isotypes may also be used to generate such fusions. 
For example, a protein - IgM fusion would generate a decavalent form of the protein 

25 of the invention. 

The isolated polynucleotide of the invention may be operably linked to an 
expression control sequence such as the pMT2 or pED expression vectors disclosed 
in Kaufman et al, Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the 
protein recombinantly. Many suitable expression control sequences are known in the 

30 art. General methods of expressing recombinant proteins are also known and are 
exemplified in R. Kaufman, Methods in Enzymology 185 , 537-566 (1990). As 
defined herein "operably linked" means that the isolated polynucleotide of the 
— , - i - >vnro^ ; inn -nntrol ^ennence nre ^iniateH within n vector or cell in 
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A number of types of cells may act as suitable host cells for expression of the 
protein. Mammalian host cells include, for example, monkey COS cells, Chinese 
Hamster Ovary (CHO) cells, human kidney 293 cells, human epidermal A43 1 cells, 
human Colo205 cells, 3T3 cells, CV-1 cells, other transformed primate cell lines, 
normal diploid cells, cell strains derived from in vitro culture of primary tissue, 
primary explants, HeLa cells, mouse L cells, BHK, HL-60, U937, HaK or Jurkat cells. 

Alternatively, it may be possible to produce the protein in lower eukaryotes 
such as yeast or in prokaryotes such as bacteria. Potentially suitable yeast strains 
include Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces 
strains, Candida, or any yeast strain capable of expressing heterologous proteins. 
Potentially suitable bacterial strains include Escherichia coli f Bacillus subtilis, 
Salmonella typhimurium, or any bacterial strain capable of expressing heterologous 
proteins. If the protein is made in yeast or bacteria, it may be necessary to modify the 
protein produced therein, for example by phosphorylation or glycosylation of the 
appropriate sites, in order to obtain the functional protein. Such covalent attachments 
may be accomplished using known chemical or enzymatic methods. 

The protein may also be produced by operably linking the isolated 
polynucleotide of the invention to suitable control sequences in one or more insect 
expression vectors, and employing an insect expression system. Materials and 
methods for baculovirus/insect cell expression systems are commercially available in 
kit form from, e.g., Invitrogen, San Diego, California, U.S.A. (the MaxBac® kit), and 
such methods are well known in the art, as described in Summers and Smith, Texas 
Agricultur al Experiment Station Bulletin No. 1555 (1987) . incorporated herein by 
reference. As used herein, an insect cell capable of expressing a polynucleotide of the 
present invention is "transformed." 

The protein of the invention may be prepared by culturing transformed host 
cells under culture conditions suitable to express the recombinant protein. The 
resulting expressed protein may then be purified from such culture (i.e., from culture 
medium or cell extracts) using known purification processes, such as gel filtration and 
ion exchange chromatography. The purification of the protein may also include an 
affinity column containing agents which will bind to the protein; one or more column 
steps over such affinity resins as concanavalin A-agarose, heparin-toyopearl® or 

10 
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5 Cibacrom blue 3GA Sepharose®; one or more steps involving hydrophobic interaction 
chromatography using such resins as phenyl ether, butyl ether, or propyl ether; or 
immunoaffinity chromatography. 

Alternatively, the protein of the invention may also be expressed in a form 
which will facilitate purification. For example, it may be expressed as a fusion 

10 protein, such as those of maltose binding protein (MBP), glutathione-S-transferase 
(GST) or thioredoxin (TRX). Kits for expression and purification of such fusion 
proteins are commercially available from New England BioLab (Beverly, MA), 
Pharmacia (Piscataway, NJ) and InVitrogen, respectively. The protein can also be 
tagged with an epitope and subsequently purified by using a specific antibody directed 

15 to such epitope. One such epitope ("Flag") is commercially available from Kodak 
(New Haven, CT). 

Finally, one or more reverse-phase high performance liquid chromatography 
(RP-HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having 
pendant methyl or other aliphatic groups, can be employed to further purify the 

20 protein. Some or all of the foregoing purification steps, in various combinations, can 
also be employed to provide a substantially homogeneous isolated recombinant 
protein. The protein thus purified is substantially free of other mammalian proteins 
and is defined in accordance with the present invention as an "isolated protein." 

The protein of the invention may also be expressed as a product of transgenic 

25 animals, e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep 
which are characterized by somatic or germ cells containing a nucleotide sequence 
encoding the protein. 

The protein may also be produced by known conventional chemical synthesis. 
Methods for constructing the proteins of the present invention by synthetic means are 

30 known to those skilled in the art. The synthetically-constructed protein sequences, by 
virtue of sharing primary, secondary or tertiary structural and/or conformational 
characteristics with proteins may possess biological properties in common therewith, 

■ I- J— r ■- x'^in 't-MTWy TVjh<: th^v mnv he Tnplovf*d V»inlncrjrnllv nctive or 
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The proteins provided herein also include proteins characterized by amino acid 
sequences similar to those of purified proteins but into which modification are 
naturally provided or deliberately engineered. For example, modifications in the 
peptide or DNA sequences can be made by those skilled in the art using known 
techniques. Modifications of interest in the protein sequences may include the 
replacement, insertion or deletion of a selected amino acid residue in the coding 
sequence. For example, one or more of the cysteine residues may be deleted or 
replaced with another amino acid to alter the conformation of the molecule. 
Mutagenic techniques for such replacement, insertion or deletion are well known to 
those skilled in the art (see, e.g., U.S. Patent No. 4,518,584). 

Other fragments and derivatives of the sequences of proteins which would be 
expected to retain protein activity in whole or in part and may thus be useful for 
screening or other immunological methodologies may also be easily made by those 
skilled in the art given the disclosures herein. Such modifications are believed to be 
encompassed by the present invention. 

USES AND BIOLOGICAL ACTIVITY 

The polynucleotides of the present invention and the proteins encoded thereby 
are expected to exhibit one or more of the uses or biological activities (including those 
associated with assays cited herein) identified below. Uses or activities described for 
proteins of the present invention may be provided by administration or use of such 
proteins or by administration or use of polynucleotides encoding such proteins (such 
as, for example, in gene therapies or vectors suitable for introduction of DNA). 

RESEARCH TOOL UTILITY 

The polynucleotides provided by the present invention can be used by the 
research community for various purposes. The polynucleotides can be used to express 
recombinant protein for analysis, characterization or therapeutic use; as markers for 
tissues in which the corresponding protein is preferentially expressed (either 
constitutively or at a particular stage of tissue differentiation or development or in 
disease states); as molecular weight markers on Southern gels; as chromosome 
markers (when labeled) to map related gene positions; to compare with endogenous 

12 
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5 DNA sequences in patients to identify potential genetic disorders; as probes to 
hybridize and thus discover novel, related DNA sequences: as a source of information 
to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" known 
sequences in the process of discovering other novel polynucleotides; to raise anti- 
protein antibodies using DNA immunization techniques: and as an antigen to raise 

10 anti-DNA antibodies or elicit another immune response. Where the polynucleotide 
encodes a protein which binds or potentially binds to another protein (such as, for 
example, in a receptor-ligand interaction), the polynucleotide can also be used in 
interaction trap assays (such as, for example, that described in Gyuris et ah, Cell 
75:791-803 (1993)) to identify polynucleotides encoding the other protein with which 

1 5 binding occurs or to identify inhibitors of the binding interaction. 

The proteins provided by the present invention can similarly be used to raise 
antibodies or to elicit another immune response; as a reagent (including the labelled 
reagent) in assays designed to quantitatively determine levels of the protein (or its 
receptor) in biological fluids; as markers for tissues in which the corresponding 

20 protein is preferentially expressed (either constitutively or at a particular stage of 
tissue differentiation or development or in a disease state); and, of course, to isolate 
correlative receptors or ligands. Where the protein binds or potentially binds to 
another protein (such as, for example, in a receptor-ligand interaction), the protein can 
be used to identify the other protein with which binding occurs or to identify inhibitors 

25 of the binding interaction. Proteins involved in these binding interactions can also be 
used to screen for peptide or small molecule inhibitors or agonists of the binding 
interaction. 

Any or all of these "research tool" utilities are capable of being developed into 
reagent grade or kit format for commercialization as "research products." 

30 

CYTOKINE AND CELL PROLIFERATION/DIFFERENTIATION 
ACTIVITY 

■ ':H:pMiju :* *!rr?r::u!:i' !' ■ :!rMi " • - ! . \ :,■■*■ • * v 

activity or may induce production oi other cytokines in certain cell populations. Mam 
protein factors discovered to date, including all known cytokines, have exhibited 
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activity in one or more factor dependent cell proliferation assays, and hence the assays 
serve as a convenient confirmation of cytokine activity. The activity of a protein of 
the present invention is evidenced by any one of a number of routine factor dependent 
cell proliferation assays for cell lines including, without limitation, 32D, DA2, DA1G, 
T10, B9, B9/1 1, BaF3, MC9/G, M+ (preB M+), 2E8, RB5, DAI, 123, Tl 165, HT2, 
CTLL2, TF-1, Mo7e and CMK. 

The activity of a protein of the invention may, among other means, be 
measured by the following methods: 

Assays for T-cell or thymocyte proliferation include without limitation those 
described in: Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, 
D.H. Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing Associates and 
Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3. 1- 
3.19; Chapter 7, Immunologic studies in Humans); Takai et al., J. Immunol. 
137:3494-3500, 1986; Bertagnolli et al M J. Immunol. 145:1706-1712, 1990; 
Bertagnolli et al., Cellular Immunology 133:327-341, 1991; Bertagnolli, et al., J. 
Immunol. 149:3778-3783, 1992; Bowman et al., J. Immunol. 152: 1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node 
cells or thymocytes include, without limitation, those described in: Polyclonal T cell 
stimulation, Kruisbeek, A.M. and Shevach, E.M. In Current Protocols in 
Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, 
Toronto. 1994; and Measurement of mouse and human Interferon y, Schreiber, R.D. 
In Current Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John 
Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and 
lymphopoietic cells include, without limitation, those described in: Measurement of 
Human and Murine Interleukin 2 and Interleukin 4, Bottomly, K., Davis, L.S. and 
Lipsky, P.E. In Current Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 
6.3.1-6.3.12, John Wiley and Sons, Toronto. 1991; deVries et al., J. Exp. Med. 
173: 1205-121 1, 1991; Moreau et al., Nature 336:690-692, 1988; Greenberger et al., 
Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; Measurement of mouse and human 
interleukin 6 - Nordan, R. In Current Protocols in Immunology. J.E.e.a. Coligan eds. 
Vol 1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; Smith et al„ Proc. Natl. 
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5 Acad. Sci. U.S.A. 83:1857-1861, 1986; Measurement of human Interleukin 11 - 
Bennett, F., Giannotti, J., Clark, S.C and Turner, K. J. In Current Protocols in 
Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.15.1 John Wiley and Sons, Toronto. 
1991; Measurement of mouse and human Interleukin 9 - Ciarletta, A., Giannotti, J., 
Clark, S.C. and Turner, K.J. In Current Protocols in Immunology. J.E.e.a. Coligan 

10 eds. Vol 1 pp. 6.13.1, John Wiley and Sons, Toronto. 1991. 

Assays for T-cell clone responses to antigens (which will identify, among 
others, proteins that affect APC-T cell interactions as well as direct T-cell effects by 
measuring proliferation and cytokine production) include, without limitation, those 
described in; Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, 

1 5 D.I I. Margulies, E.M. Shevach, W Strobcr 

Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays 
for Mouse Lymphocyte Function; Chapter 6, Cytokines and their cellular receptors; 
Chapter 7, Immunologic studies in Humans); Weinberger et al., Proc. Natl. Acad. Sci. 
USA 77:6091-6095, 1980; Weinberger etaL, Eur. J. Immun. 11:405-411, 1981;Takai 

20 et al., J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 
1988. 

IMMUNE STIMULATING/SUPPRESSING ACTIVITY 

A protein of the present invention may also exhibit immune stimulating or 
25 immune suppressing activity, including without limitation the activities for which 
assays are described herein. A protein may be useful in the treatment of various 
immune deficiencies and disorders (including severe combined immunodeficiency 
(SOD)), e.g., in regulating (up or down) growth and proliferation of T and/or B 
lymphocytes, as well as effecting the cytolytic activity of NK cells and other cell 
30 populations. These immune deficiencies may be genetic or be caused by viral (e.g., 
HIV) as well as bacterial or fungal infections, or may result from autoimmune 
disorders. More specifically, infectious diseases causes by viral, bacterial, fungal or 
■ ■ *' * : ■• v>,> fr^tnMn "c-inn n nrntrin of thf pr^s^nt invention inHuriinr 
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the present invention may also be useful where a boost to the immune system 
generally would be indicated, i.e., in the treatment of cancer. 

Autoimmune disorders which may be treated using a protein of the present 
invention include, for example, multiple sclerosis, systemic lupus erythematosus, 
rheumatoid arthritis, autoimmune pulmonary inflammation, Guillain-Barre syndrome, 
autoimmune thyroiditis, insulin dependent diabetes mellitis, myasthenia gravis, graft- 
versus-host disease and autoimmune inflammatory eye disease. Such a protein of the 
present invention may also to be useful in the treatment of allergic reactions and 
conditions, such as asthma or other respiratory problems. Other conditions, in which 
immune suppression is desired (including, for example, asthma and related respriatory 
conditions), may also be treatable using a protein of the present invention. 

A protein of the present invention may also suppress chronic or acute 
inflammation, such as, for example, that associated with infection (such as septic 
shock or systemic inflammatory response syndrome (SIRS)), inflammatory bowel 
disease, Crohn's disease or resulting from over production of cytokines such as TNF 
or IL-1 (such as the effect demonstrated by IL-1 1). 

The activity of a protein of the invention may, among other means, be 
measured by the following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without 
limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, 
A.M. Kruisbeek, D.H. Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing 
Associates and Wiley-Literscience (Chapter 3, In Vitro assays for Mouse Lymphocyte 
Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); Herrmann et al., 
Proc. Natl. Acad. Sci. USA 78:2488-2492, 1981; Herrmann et al., J. Immunol. 
128:1968-1974, 1982; Handa et al., J. Immunol. 135:1564-1572, 1985; Takaiet al., 
J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988; 
Herrmann et al., Proc. Natl. Acad. Sci. USA 78:2488-2492, 1981; Herrmann et al., J. 
Immunol. 128:1968-1974, 1982; Handa et al., J. Immunol. 135:1564-1572, 1985; 
Takai et al., J. Immunol. 137:3494-3500, 1986; Bowmanet al., J. Virology 61:1992- 
1998; Takai et al., J. Immunol. 140:508-512, 1988; Bertagnolli et al., Cellular 
Immunology 133:327-341, 1991; Brown etal., J. Immunol. 153:3079-3092, 1994. 

16 
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5 Assays for T-cell-dependent immunoglobulin responses and isotype switching 

(which will identify, among others, proteins that modulate T-cell dependent antibody 
responses and that affect Thl/Th2 profiles) include, without limitation, those 
described in: Maliszewski, J. Immunol 144:3028-3033, 1990; and Assays for B cell 
function: In vitro antibody production, Mond, J.J. and Brunswick, M. In Current 

1 0 Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 3.8. 1-3.8. 16, John Wiley and 
Sons, Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will identify, among others, 
proteins that generate predominantly Thl and CTL responses) include, without 
limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, 

15 A.M. Kruisbeek, D.II. Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing 
Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte 
Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); Takai et al., J. 
Immunol. 137:3494-3500, 1986; Takai et al.; J. Immunol. 140:508-512, 1988; 
Bertagnolli et al., J. Immunol. 149:3778-3783, 1992. 

20 Dendritic cell-dependent assays (which will identify, among others, proteins 

expressed by denritic cells that activate naive T-cells) include, without limitation, 
those described in: Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al., Journal 
of Experimenal Medicine 173:549-559, 1991; Macatonia et al., Journal of 
Immunology 154:5071-5079, 1995; Porgadoret al., Journal of Experimental Medicine 

25 182:255-260, 1995; Nair et al., Journal of Virology 67:4062-4069, 1993; Huang et 
al., Science 264:961-965, 1994; Macatonia et al., Journal of Experimental Medicine 
169:1255-1264, 1989; Bhardwaj et al., Journal of Clinical Investigation 94:797-807, 
1994; and Inaba et ah, Journal of Experimental Medicine 172:631-640, 1990. 

Assays for lymphocyte survival/apoptosis (which will identify, among others, 

30 proteins that prevent apoptosis after superantigen induction and proteins that regulate 
lymphocyte homeostasis) include, without limitation, those described in: 
Darzynkiewicz et al., Cytometry 13:795-808, 1992; Gorczyca et al., Leukemia 

- /-en r~<r\ on- r ,,t -J r-,npor P^c^rrb <n-104* 105 1 1001- Ttnb rt nl 
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Assays for proteins that influence early steps of T-cell commitment and 
development include, without limitation, those described in: Antica et aL, Blood 
84:111-117, 1994; Fine etal., Cellular Immunology 155:111-122, 1994; Galy et al., 
Blood 85:2770-2778, 1995; Toki et al., Proc. Nat. Acad Sci. USA 88:7548-7551, 
1991. 

HEMATOPOIESIS REGULATING ACTIVITY 

A protein of the present invention may be useful in regulation of hematopoiesis 
and, consequently, in the treatment of myeloid or lymphoid cell deficiencies. Even 
marginal biological activity in support of colony forming cells or of factor-dependent 
cell lines indicates involvement in regulating hematopoiesis. e.g. in supporting the 
growth and proliferation of erythroid progenitor cells alone or in combination with 
other cytokines, thereby indicating utility, for example, in treating various anemias or 
for use in conjunction with irradiation/chemotherapy to stimulate the production of 
erythroid precursors and/or erythroid cells; in supporting the growth and proliferation 
of myeloid cells such as granulocytes and monocytes/macrophages (i.e., traditional 
CSF activity) useful, for example, in conjunction with chemotherapy to prevent or 
treat consequent myelo-suppression; in supporting the growth and proliferation of 
megakaryocytes and consequently of platelets thereby allowing prevention or 
treatment of various platelet disorders such as thrombocytopenia, and generally for use 
in place of or complimentarily to platelet transfusions; and/or in supporting the growth 
and proliferation of hematopoietic stem cells which are capable of maturing to any and 
all of the above-mentioned hematopoietic cells and therefore find therapeutic utility 
in various stem cell disorders (such as those usually treated with transplantation, 
including, without limitation, aplastic anemia and paroxysmal nocturnal 
hemoglobinuria), as well as in repopulating the stem cell compartment post 
irradiation/chemotherapy, either in-vivo or ex-vivo (i.e. in conjunction with bone 
marrow transplantation) as normal cells or genetically manipulated for gene therapy. 

The activity of a protein of the invention may, among other means, be 
measured by the following methods: 

Suitable assays for proliferation and differentiation of various hematopoietic 
lines are cited above. 



18 



WO 97/07198 



PCT/US96/12897 



5 Assays for embryonic stem cell differentiation (which will identify, among 

others, proteins that influence embyronic differentation hematopoiesis) include, 
without limitation, those described in: Johansson et al. Cellular Biology 15:141-151, 
1995; Keller et al., Molecular and Cellular Biology 13:473-486, 1993; McClanahan 
et al., Blood 81:2903-2915, 1993. 

10 Assays for stem cell survival and differentiation (which will identify, among 

others, proteins that regulate lympho-hematopoiesis) include, without limitation, those 
described in: Methylcellulose colony forming assays, Freshney, M.G. In Culture of 
Hematopoietic Cells. R.L Freshney, et al eds. Vol pp. 265-268, Wiley-Liss, Inc., 
New York, NY. 1994; Hirayama et al., Proc. Natl. Acad. Sci. USA 89:5907-591 1, 

15 1992; Primitive hematopoietic colony forming cells with high proliferative potential, 
McNiece, I.K. and Briddell, R. A. In Culture of Hematopoietic Cells. R.I. Freshney, 
et al eds. Vol pp. 23-39, Wiley-Liss, Inc., New York, NY. 1994; Neben et al., 
Experimental Hematology 22:353-359, 1994; Cobblestone area forming cell assay, 
Ploemacher, R.E. In Culture of Hematopoietic Cells. R.L Freshney, et al. eds. Vol pp. 

20 1-21, Wiley-Liss, Inc., New York, NY. 1994; Long term bone marrow cultures in the 
presence of stromal cells, Spooncer, E., Dexter, M. and Allen, T. In Culture of 
Hematopoietic Cells. R.L Freshney, et al. eds. Vol pp. 163-179, Wiley-Liss, Inc., 
New York, NY. 1994; Long term culture initating cell assay, Sutherland, H.J. In 
Culture of Hematopoietic Cells. R.L Freshney, et al eds. Vol pp. 139-162, 

25 Wiley-Liss, Inc., New York, NY. 1994. 

TISSUE GENERATION/REGENERATION ACTIVITY 

A protein of the present invention also may have utility in compositions used 
for bone, cartilage, tendon, ligament and/or nerve tissue growth or regeneration, as 
30 well as for wound healing and tissue repair, and in the treatment of burns, incisions 
and ulcers. 

A protein of the present invention, which induces cartilage and/or bone growth 

:Ki-r run.- -^v " '; t ikk : ■■ 
i> preparation employing a protein ol the invention may have piophyiactiL u>>c in cioscc 
as well as open fracture reduction and also in the improved fixation of artificial joints. 
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5 De novo bone formation induced by an osteogenic agent contributes to the repair of 
congenital, trauma induced, or oncologic resection induced craniofacial defects, and 
also is useful in cosmetic plastic surgery. 

A protein of this invention may also be used in the treatment of periodontal 
disease, and in other tooth repair processes. Such agents may provide an environment 
10 to attract bone-forming cells, stimulate growth of bone-forming cells or induce 
differentiation of progenitors of bone-forming cells. A protein of the invention may 
also be useful in the treatment of osteoporosis or osteoarthritis, such as through 
stimulation of bone and/or cartilage repair or by blocking inflammation or processes 
of tissue destruction (collagenase activity, osteoclast activity, etc.) mediated by 
15 inflammatory processes. 

Another category of tissue regeneration activity that may be attributable to the 
protein of the present invention is tendon/ligament formation. A protein of the present 
invention, which induces tendon/ligament-like tissue or other tissue formation in 
circumstances where such tissue is not normally formed, has application in the healing 
20 of tendon or ligament tears, deformities and other tendon or ligament defects in 
humans and other animals. Such a preparation employing a tendon/ligament-like 
tissue inducing protein may have prophylactic use in preventing damage to tendon or 
ligament tissue, as well as use in the improved fixation of tendon or ligament to bone 
or other tissues, and in repairing defects to tendon or ligament tissue. De novo 
25 tendon/ligament-like tissue formation induced by a composition of the present 
invention contributes to the repair of congenital, trauma induced, or other tendon or 
ligament defects of other origin, and is also useful in cosmetic plastic surgery for 
attachment or repair of tendons or ligaments. The compositions of the present 
invention may provide an environment to attract tendon- or ligament-forming cells, 
30 stimulate growth of tendon- or ligament-forming cells, induce differentiation of 
progenitors of tendon- or ligament-forming cells, or induce growth of tendon/ligament 
cells or progenitors ex vivo for return in vivo to effect tissue repair. The compositions 
of the invention may also be useful in the treatment of tendinitis, carpal tunnel 
syndrome and other tendon or ligament defects. The compositions may also include 
35 an appropriate matrix and/or sequestering agent as a carrier as is well known in the art. 
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5 The protein of the present invention may also be useful for proliferation of 

neural cells and for regeneration of nerve and brain tissue, i.e. for the treatment of 
central and peripheral nervous system diseases and neuropathies, as well as 
mechanical and traumatic disorders, which involve degeneration, death or trauma to 
neural cells or nerve tissue. More specifically, a protein may be used in the treatment 
10 of diseases of the peripheral nervous system, such as peripheral nerve injuries, 
peripheral neuropathy and localized neuropathies, and central nervous system 
diseases, such as Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 
lateral sclerosis, and Shy-Drager syndrome. Further conditions which may be treated 
in accordance with the present invention include mechanical and traumatic disorders, 
15 such as spinal cord disorders, head trauma and cerebrovascular diseases such as 
stroke. Peripheral neuropathies resulting from chemotherapy or other medical 
therapies may also be treatable using a protein of the invention. 

It is expected that a protein of the present invention may also exhibit activity 
for generation of other tissues, such as organs (including, for example, pancreas, liver, 
20 intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and 
vascular (including vascular endothelium) tissue, or for promoting the growth of cells 
comprising such tissues. Part of the desired effects may be by inhibition of fibrotic 
scarring to allow normal tissue to regenerate. 

A protein of the present invention may also be useful for gut protection or 
25 regeneration and treatment of lung or liver fibrosis, reperfusion injury in various 
tissues, and conditions resulting from systemic cytokine damage. 

The activity of a protein of the invention may, among other means, be 
measured by the following methods: 

Assays for tissue generation activity include, without limitation, those 
30 described in: International Patent Publication No. WO95/16035 (bone, cartilage, 
tendon); International Patent Publication No. WO95/05846 (nerve, neuronal); 
International Patent Publication No. W09 1/07491 (skin, endothelium ). 
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o A protein oi the presem invention niav <ii.su exhbu acuvin- oi innibin-rciatLv 

activities, lnhibins are characterized by their ability to inhibit the release of follicle 



97/07198 



PCT/US96/12897 



stimulating hormone (FSH), while activins and are characterized by their ability to 
stimulate the release of follicle stimulating hormone (FSH). Thus, a protein of the 
present invention, alone or in heterodimers with a member of the inhibin a family, 
may be useful as a contraceptive based on the ability of inhibins to decrease fertility 
in female mammals and decrease spermatogenesis in male mammals. Administration 
of sufficient amounts of other inhibins can induce infertility in these mammals. 
Alternatively, the protein of the invention, as a homodimer or as a heterodimer with 
other protein subunits of the inhibin-P group, may be useful as a fertility inducing 
therapeutic, based upon the ability of activin molecules in stimulating FSH release 
from cells of the anterior pituitary. See, for example, United States Patent 4,798,885. 
A protein of the invention may also be useful for advancement of the onset of fertility 
in sexually immature mammals, so as to increase the lifetime reproductive 
performance of domestic animals such as cows, sheep and pigs. 

The activity of a protein of the invention may, among other means, be 
measured by the following methods: 

Assays for activin/inhibin activity include, without limitation, those described 
in: Vale et aL, Endocrinology 91:562-572, 1972; Ling et al., Nature 321:779-782, 
1986; Vale etal., Nature 321:776-779, 1986; Mason etal, Nature 318:659-663, 1985; 
Forage et al., Proc. Natl. Acad. Sci. USA 83:3091-3095, 1986. 

CHEMOTACnC/CHEMOKINETIC ACTIVITY 

A protein of the present invention may have chemotactic or chemokinetic 
activity (e.g., act as a chemokine) for mammalian cells, including, for example, 
monocytes, neutrophils, T-cells, mast cells, eosinophils and/or endothelial cells. 
Chemotactic and chemokinetic proteins can be used to mobilized or attract a desired 
cell population to a desired site of action. Chemotactic or chemokinetic proteins 
provide particular advantages in treatment of wounds and other trauma to tissues, as 
well as in treatment of localized infections. For example, attraction of lymphocytes, 
monocytes or neutrophils to tumors or sites of infection may result in improved 
immune responses against the tumor or infecting agent. 

A protein or peptide has chemotactic activity for a particular cell population 
if it can stimulate, directly or indirecdy, the directed orientation or movement of such 
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5 cell population. Preferably, the protein or peptide has the ability to directly stimulate 
directed movement of cells. Whether a particular protein has chemotactic activity for 
a population of cells can be readily determined by employing such protein or peptide 
in any known assay for cell chemotaxis. 

The activity of a protein of the invention may, among other means, be 
10 measured by the following methods: 

Assays for chemotactic activity (which will identify proteins that induce or 
prevent chemotaxis) consist of assays that measure the ability of a protein to induce 
the migration of cells across a membrane as well as the ability of a protein to induce 
the adhesion of one cell population to another cell population. Suitable assays for 
i5 movement and adhesion include, without limitation, those described in: Current 
Protocols in Immunology, Ed by IE. Coligan, A.M. Kruisbeek, D.H. Margulies, E.M. 
Shevach, W.Strober, Pub. Greene Publishing Associates and Wiley-Intersciece 
(Chapter 6.12, Measurement of alpha and beta Chemokines 6.12.1-6.12.28; Taub et 
al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. APMIS 103:140-146, 1995; Muller 
20 et al Eur. J. Immunol. 25: 1744-1748; Gruber et al. J. of Immunol. 152:5860-5867, 
1994; Johnston et al. J. of Immunol. 153: 1762-1768, 1994. 

HEMOSTATIC AND THROMBOLYTIC ACTIVITY 

A protein of the invention may also exhibit hemostatic or thrombolytic 
25 activity. As a result, such a protein is expected to be useful in treatment of various 
coagulation disorders (including hereditary disorders, such as hemophilias) or to 
enhance coagulation and other hemostatic events in treating wounds resulting from 
trauma, surgery or other causes. A protein of the invention may also be useful for 
dissolving or inhibiting formation of thromboses and for treatment and prevention of 
30 conditions resulting therefrom (such as, for example, infarction or stroke). 

The activity of a protein of the invention may, among other means, be 
measured by the following methods: 

I'hromDosis Res. 4^:413-419, i L Jb/: Humphrey a ai., hibnnoiysi> o. V/v » ivvi 
Schaub, Prostaglandins 35:467-474, 1988. 
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RECEPTOR/UGAND ACTTVTTY 

A protein of the present invention may also demonstrate activity as receptors, 
receptor ligands or inhibitors or agonists of receptor/ligand interactions. Examples of 
such receptors and ligands include, without limitation, cytokine receptors and their 
ligands, receptor kinases and their ligands, receptor phosphatases and their ligands, 
receptors involved in cell-cell interactions and their ligands (including without 
limitation, cellular adhesion molecules (such as selectins, integrins and their ligands) 
and receptor/ligand pairs involved in antigen presentation, antigen recognition and 
development of cellular and humoral immune responses). Receptors and ligands are 
also useful for screening of potential peptide or small molecule inhibitors of the 
relevant receptor/ligand interaction. A protein of the present invention (including, 
without limitation, fragments of receptors and ligands) may themselves be useful as 
inhibitors of receptor/ligand interactions. 

The activity of a protein of the invention may, among other means, be 
measured by the following methods: 

Suitable assays for receptor-ligand activity include without limitation those 
described in:Current Protocols in Immunology, Ed by J.E. Coligan, A.M. Kruisbeek, 
D.H. Margulies, E.M. Shevach, W.Strober, Pub. Greene Publishing Associates and 
Wiley-Intersciece (Chapter 7.28, Measurement of Cellular Adhesion under static 
conditions 7.28.1-7.28.22), Takai et al., Proc. Natl. Acad. Sci. USA 84:6864-6868, 
1987; Biereretal., J. Exp. Med. 168:1145-1156, 1988; Rosenstein et al., J.Exp. 
Med. 169:149-160 1989; Stoltenborg et al., J. Immunol. Methods 175:59-68, 1994; 
Stitt et al., Cell 80:661-670, 1995. 

OTHER ACTTVITTES 

A protein of the invention may also exhibit one or more of the following 
additional activities or effects: killing infectious agents, including, without limitation, 
bacteria, viruses, fungi and other parasites; effecting (suppressing or enhancing) bodily 
characteristics, including, without limitation, height, weight, hair color, eye color, skin 
or other tissue pigmentation, or organ size (such as, for example, breast augmentation 
or diminution); effecting the processing of dietary fat, protein or carbohydrate; 
effecting behavioral characteristics, including, without limitation, appetite, libido, 
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5 stress, cognition (including cognitive disorders), depression (including depressive 
disorders) and violent behaviors; providing analgesic effects or other pain reducing 
effects; promoting differentiation and growth of ebryonic stem cells in lineages other 
than hematopoietic lineages; and in the case of enzymes, correcting deficiencies of the 
enzyme and treating related diseases. 

10 

ADMINISTRATION AND DOSING 

A protein of the present invention (from whatever source derived, including 
without limitation from recombinant and non-recombinant sources) may be used in 
a pharmaceutical composition when combined with a pharmaceutically acceptable 

15 carrier. Such a composition may also contain (in addition to protein and a carrier) 
diluents, fillers, salts, buffers, stabilizers, solubilizers, and other materials well known 
in the art. The term "pharmaceutically acceptable" means a non-toxic material that 
does not interfere with the effectiveness of the biological activity of the active 
ingredient(s). The characteristics of the carrier will depend on the route of 

20 administration. The pharmaceutical composition of the invention may also contain 
cytokines, lymphokines, or other hematopoietic factors such as M-CSF, GM-CSF, 
TNF, IL-1, IL-2, IL-3, 11^4, IL-5, H^6, IL-7, IL-8, BL-9, IL-10, IL-1 1, IL-12, IL-13, DL- 
14, IL-15, IFN, TNFO, TNF1, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem cell 
factor, and erythropoietin. The pharmaceutical composition may further contain other 

25 agents which either enhance the activity of the protein or compliment its activity or 
use in treatment. Such additional factors and/or agents may be included in the 
pharmaceutical composition to produce a synergistic effect with protein of the 
invention, or to minimize side effects. Conversely, protein of the present invention 
may be included in formulations of the particular cytokine, lymphokine, other 

30 hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti-inflammatory 
agent to minimize side effects of the cytokine, lymphokine, other hematopoietic factor, 
thrombolytic or anti-thrombotic factor, or anti-inflammatory agent. 



pharmaceutical compositions oi tne invention may comprise a protein oi the inventioii 
in such multimenc or complexed form. 
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The pharmaceutical composition of the invention may be in the form of a 
complex of the protein(s) of present invention along with protein or peptide antigens. 
The protein and/or peptide antigen will deliver a stimulatory signal to both B and T 
lymphocytes. B lymphocytes will respond to antigen through their surface 
immunoglobulin receptor. T lymphocytes will respond to antigen through the T cell 
receptor (TCR) following presentation of the antigen by MHC proteins. MHC and 
structurally related proteins including those encoded by class I and class H MHC genes 
on host cells will serve to present the peptide antigen(s) to T lymphocytes. The 
antigen components could also be supplied as purified MHC-peptide complexes alone 
or with co-stimulatory molecules that can directly signal T cells. Alternatively 
antibodies able to bind surface immunolgobulin and other molecules on B cells as well 
as antibodies able to bind the TCR and other molecules on T cells can be combined 
with the pharmaceutical compositon of the invention. 

The pharmaceutical composition of the invention may be in the form of a 
liposome in which protein of the present invention is combined, in addition to other 
pharmaceutically acceptable carriers, with amphipathic agents such as lipids which 
exist in aggregated form as micelles, insoluble monolayers, liquid crystals, or lamellar 
layers in aqueous solution. Suitable lipids for liposomal formulation include, without 
limitation, monoglycerides, diglycerides, sulfatides, lysolecithin, phospholipids, 
saponin, bile acids, and the like. Preparation of such liposomal formulations is within 
the level of skill in the art, as disclosed, for example, in U.S. Patent No. 4,235,871; 
U.S. Patent No. 4,501,728; U.S. Patent No. 4,837,028; and U.S. Patent No. 4,737,323, 
all of which are incorporated herein by reference. 

As used herein, the term "therapeutically effective amount" means the total 
amount of each active component of the pharmaceutical composition or method that 
is sufficient to show a meaningful patient benefit, i.e., treatment, healing, prevention 
or amelioration of the relevant medical condition, or an increase in rate of treatment, 
healing, prevention or amelioration of such conditions. When applied to an individual 
active ingredient, administered alone, the term refers to that ingredient alone. When 
applied to a combination, the term refers to combined amounts of the active 
ingredients that result in the therapeutic effect, whether administered in combination, 
serially or simultaneously. 
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5 In practicing the method of treatment or use of the present invention, a 

therapeutically effective amount of protein of the present invention is administered to 
a mammal having a condition to be treated. Protein of the present invention may be 
administered in accordance with the method of the invention either alone or in 
combination with other therapies such as treatments employing cytokines, 

10 lymphokines or other hematopoietic factors. When co-administered with one or more 
cytokines, lymphokines or other hematopoietic factors, protein of the present 
invention may be administered either simultaneously with the cytokine(s), 
lymphokine(s), other hematopoietic factors), thrombolytic or anti-thrombotic factors, 
or sequentially. If administered sequentially, the attending physician will decide on 

15 the appropriate sequence of administering protein of the present invention in 
combination with cytokine(s), lymphokine(s), other hematopoietic factor(s), 
thrombolytic or anti-thrombotic factors. 

Administration of protein of the present invention used in the pharmaceutical 
composition or to practice the method of the present invention can be carried out in 

20 a variety of conventional ways, such as oral ingestion, inhalation, or cutaneous, 
subcutaneous, or intravenous injection. Intravenous administration to the patient is 
preferred. 

When a therapeutically effective amount of protein of the present invention is 
administered orally, protein of the present invention will be in the form of a tablet, 

25 capsule, powder, solution or elixir. When administered in tablet form, the 
pharmaceutical composition of the invention may additionally contain a solid carrier 
such as a gelatin or an adjuvant. The tablet, capsule, and powder contain from about 
5 to 95% protein of the present invention, and preferably from about 25 to 90% 
protein of the present invention. When administered in liquid form, a liquid carrier 

30 such as water, petroleum, oils of animal or plant origin such as peanut oil, mineral oil, 
soybean oil, or sesame oil, or synthetic oils may be added. The liquid form of the 
pharmaceutical composition may further contain physiological saline solution, 

^oiiipoMiiun conUiins hum uboui LO to 90% by weight ol protein oi U\c prcscrn 
invention, and preferably from about 1 to 50% protein of the present invention. 
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When a therapeutically effective amount of protein of the present invention is 
administered by intravenous, cutaneous or subcutaneous injection, protein of the 
present invention will be in the form of a pyrogen-free, parenterally acceptable 
aqueous solution. The preparation of such parenterally acceptable protein solutions, 
having due regard to pH, isotonicity, stability, and the like, is within the skill in the art. 
A preferred pharmaceutical composition for intravenous, cutaneous, or subcutaneous 
injection should contain, in addition to protein of the present invention, an isotonic 
vehicle such as Sodium Chloride Injection, Ringer's Injection, Dextrose Injection, 
Dextrose and Sodium Chloride Injection, Lactated Ringer's Injection, or other vehicle 
as known in the art. The pharmaceutical composition of the present invention may 
also contain stabilizers, preservatives, buffers, antioxidants, or other additives known 
to those of skill in the art. 

The amount of protein of the present invention in the pharmaceutical 
composition of the present invention will depend upon the nature and severity of the 
condition being treated, and on the nature of prior treatments which the patient has 
undergone. Ultimately, the attending physician will decide the amount of protein of 
the present invention with which to treat each individual patient. Initially, the 
attending physician will administer low doses of protein of the present invention and 
observe the patient's response. Larger doses of protein of the present invention may 
be administered until the optimal therapeutic effect is obtained for the patient, and at 
that point the dosage is not increased further. It is contemplated that the various 
pharmaceutical compositions used to practice the method of the present invention 
should contain about 0.01 pg to about 100 mg (preferably about 0. lug to about 10 mg, 
more preferably about 0.1 pg to about 1 mg) of protein of the present invention per kg 
body weight. 

The duration of intravenous therapy using the pharmaceutical composition of 
the present invention will vary, depending on the severity of the disease being treated 
and the condition and potential idiosyncratic response of each individual patient. It 
is contemplated that the duration of each application of the protein of the present 
invention will be in the range of 12 to 24 hours of continuous intravenous 
administration. Ultimately the attending physician will decide on the appropriate 
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5 duration of intravenous therapy using the pharmaceutical composition of the present 
invention. 

Protein of the invention may also be used to immunize animals to obtain 
polyclonal and monoclonal antibodies which specifically react with the protein. Such 
antibodies may be obtained using either the entire protein or fragments thereof as an 

10 immunogen. The peptide immunogens additionally may contain a cysteine residue at 
the carboxyl terminus, and are conjugated to a hapten such as keyhole limpet 
hemocyanin (KLH). Methods for synthesizing such peptides are known in the art, for 
example, as in R.P. Merrifield, J. Amer.Chem.Soc. 85, 2149-2154 (1963); J.L. 
Krstenansky, et ai 3 FEBS Lett. 2H, 10 (1987). Monoclonal antibodies binding to the 

i 5 protein of the invention may be useful diagnostic agents for the immunodetection of 
the protein. Neutralizing monoclonal antibodies binding to the protein may also be 
useful therapeutics for both conditions associated with the protein and also in the 
treatment of some forms of cancer where abnormal expression of the protein is 
involved. In the case of cancerous cells or leukemic cells, neutralizing monoclonal 

20 antibodies against the protein may be useful in detecting and preventing the metastatic 
spread of the cancerous cells, which may be mediated by the protein. 

For compositions of the present invention which are useful for bone, cartilage, 
tendon or ligament regeneration, the therapeutic method includes administering the 
composition topically, systematically, or locally as an implant or device. When 

25 administered, the therapeutic composition for use in this invention is, of course, in a 
pyrogen-free, physiologically acceptable form. Further, the composition may 
desirably be encapsulated or injected in a viscous form for delivery to the site of bone, 
cartilage or tissue damage. Topical administration may be suitable for wound healing 
and tissue repair. Therapeutically useful agents other than a protein of the invention 

30 which may also optionally be included in the composition as described above, may 
alternatively or additionally, be administered simultaneously or sequentially with the 
composition in the methods of the invention. Preferably for bone and/or cartilage 

^tructuic lor the Developing Done anu cartilage ana upurnaiiv eapaDiL k>: being 
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resorbed into the body. Such matrices may be formed of materials presently in use for 
other implanted medical applications. 

The choice of matrix material is based on biocompatibility, biodegradability, 
mechanical properties, cosmetic appearance and interface properties. The particular 
application of the compositions will define the appropriate formulation. Potential 
matrices for the compositions may be biodegradable and chemically defined calcium 
sulfate, tricalciumphosphate, hydroxyapatite, polylactic acid, polyglycolic acid and 
polyanhydrides. Other potential materials are biodegradable and biologically well- 
defined, such as bone or dermal collagen. Further matrices are comprised of pure 
proteins or extracellular matrix components. Other potential matrices are 
nonbiodegradable and chemically defined, such as sintered hydroxapatite, bioglass, 
aluminates, or other ceramics. Matrices may be comprised of combinations of any of 
the above mentioned types of material, such as polylactic acid and hydroxyapatite or 
collagen and tricalciumphosphate. The bioceramics may be altered in composition, 
such as in calcium-aluminate-phosphate and processing to alter pore size, particle size, 
particle shape, and biodegradability. 

Presently preferred is a 50:50 (mole weight) copolymer of lactic acid and 
glycolic acid in the form of porous particles having diameters ranging from 150 to 800 
microns. In some applications, it will be useful to utilize a sequestering agent, such 
as carboxymethyl cellulose or autologous blood clot, to prevent the protein 
compositions from disassociating from the matrix. 

A preferred family of sequestering agents is cellulosic materials such as 
alkylcelluloses (including hydroxyalkylcelluloses), including methylcellulose, 
ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl- 
methylcellulose, and carboxymethylcellulose, the most preferred being cationic salts 
of carboxymethylcellulose (CMC). Other preferred sequestering agents include 
hyaluronic acid, sodium alginate, polyethylene glycol), polyoxyethylene oxide, 
carboxyvinyl polymer and polyvinyl alcohol). The amount of sequestering agent 
useful herein is 0.5-20 wt%, preferably 1-10 wt% based on total formulation weight, 
which represents the amount necessary to prevent desorbtion of the protein from the 
polymer matrix and to provide appropriate handling of the composition, yet not so 
much that the progenitor cells are prevented from infiltrating the matrix, thereby 
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5 providing the protein the opportunity to assist the osteogenic activity of the progenitor 
cells. 

In further compositions, proteins of the invention may be combined with other 
agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue 
in question. These agents include various growth factors such as epidermal growth 
10 factor (EGF), platelet derived growth factor (PDGF), transforming growth factors 
(TGF-a and TGF-P), and insulin-like growth factor (IGF). 

The therapeutic compositions are also presently valuable for veterinary 
applications. Particularly domestic animals and thoroughbred horses, in addition to 
humans, are desired patients for such treatment with proteins of the present invention. 

1 5 The dosage regimen of a protein-containing pharmaceutical composition to be 

used in tissue regeneration will be determined by the attending physician considering 
various factors which modify the action of the proteins, e.g., amount of tissue weight 
desired to be formed, the site of damage, the condition of the damaged tissue, the size 
of a wound, type of damaged tissue (e.g., bone), the patient's age, sex, and diet, the 

20 severity of any infection, time of administration and other clinical factors. The dosage 
may vary with the type of matrix used in the reconstitution and with inclusion of other 
proteins in the pharmaceutical composition. For example, the addition of other known 
growth factors, such as IGF I (insulin like growth factor I), to the final composition, 
may also effect the dosage. Progress can be monitored by periodic assessment of 

25 tissue/bone growth and/or repair, for example, X-rays, histomorphometric 
determinations and tetracycline labeling. 

Polynucleotides of the present invention can also be used for gene therapy. 
Such polynucleotides can be introduced either in vivo or ex vivo into cells for 
expression in a mammalian subject. Polynucleotides of the invention may also be 

30 administered by other known methods for introduction of nucleic acid into a cell or 
organism (including, without limitation, in the form of viral vectors or naked DNA). 
Cells may also be cultured ex vivo in the presence of proteins of the present 
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5 Patent and literature references cited herein are incorporated by reference 

if fully set forth. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Jacobs, Kenneth 
McCoy, John 
Kelleher, Kerry- 
Carl in, McKeough 

(ii) TITLE OF INVENTION: DNA SEQUENCES AND SECRETED PROTEINS 
ENCODED THEREBY 

Ciii) NUMBER OF SEQUENCES: 12 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genetics Institute, Inc. -- Legal Affairs 

(B) STREET: 87 Cambridge Park Drive 

(C) CITY: Cambridge 

(D) STATE: Massachusetts 
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(F) ZIP: 02140 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Brown, Scott A. 

(B) REGISTRATION NUMBER: 32,724 
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(A) TELEPHONE: (617) 498-8224 

(B) TELEFAX: (617) 876-5851 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GAGAAGATAA AACTGGACAC TGGGGAGACA CAACTTC ATG CTG CGT GGG ATC TCC 55 

Met Leu Arg Gly lie Ser 
1 5 

CAG CTA CCT GCA GTG GCC ACC ATG TCT TGG GTC CTG CTG CCT GTA CTT 103 
Gin Leu Pro Ala Val Ala Thr Met Ser Trp Val Leu Leu Pro Val Leu 
10 15 20 

TGG CTC ATT GTT CAA ACT CAA GCA ATA GCC ATA AAG CAA ACA CCT GAA 151 
Trp Leu He Val Gin Thr Gin Ala He Ala He Lys Gin Thr Pro Glu 
25 30 35 

TTA ACG CTC CAT GAA ATA GTT TGT CCT AAA AAA CTT CAC ATT TTA CAC 19 9 

Leu Thr Leu His Glu He Val Cys Pro Lys Lys Leu His He Leu His 
40 45 50 

AAA AGA GAG ATC AAG AAC AAC CAG ACA GAA AAG CAT GGC AAA GAG GAA 247 
Lys Arg Glu He Lys Asn Asn Gin Thr Glu Lys His Gly Lys Glu Glu 
55 60 65 70 

AGG TAT GAA CCT GAA GTT CAA TAT CAG ATG ATC TTA AAT GGA GAA GAA 29 5 

Arg Tyr Glu Pro Glu Val Gin Tyr Gin Met He Leu Asn Gly Glu Glu 
75 80 85 

ATC ATT CTC TCC CTA CAA AAA ACC AAG CAC CTC CTG GGG CCA GAC TAC 3 43 

He He Leu Ser Leu Gin Lys Thr Lys His Leu Leu Gly Pro Asp Tyr 
90 95 100 

ACT GAA ACA TTG TAC TCA CCC AGA GGA GAG GAA ATT ACC ACG AAA CCT 3 91 

Thr Glu Thr Leu Tyr Ser Pro Arg Gly Glu Glu He Thr Thr Lys Pro 
105 HO H5 

GAG AAC ATG GAA CAC TGT TAC TAT AAA GGA AAC ATC CTA AAT GAA AAG 43 9 

Glu Asn Met Glu His Cys Tyr Tyr Lys Gly Asn He Leu Asn Glu Lys 
120 125 130 

AAT TCT GTT GCC AGC ATC AGT ACT TGT GAC GGG TTG AGA GGA TAC TTC 4 87 

Asn Ser Val Ala Ser He Ser Thr Cys Asp Gly Leu Arg Gly Tyr Phe 
i35 140 145 150 

ACA CAT CAT CAC CAA AGA TAC CAG ATA AAA CCT CTG AAA AGC ACA GAC 53 5 

Thr His His His Gin Arg Tyr Gin He Lys Pro Leu Lys Ser Thr Asp 
155 160 165 

GAG AAA GAA CAT GCC GTC TTT ACA TCT AAC CAG GAG GAA CAA GAC CCA 583 
Glu Lys Glu His Ala Val Phe Thr Ser Asn Gin Glu Glu Gin Asp Pro 
170 175 180 

GCT AAC CAC ACA TGT GGT GTG AAG AGC ACT GAC GGG AAA CAA GGC CCA 631 
Ala Asn His Thr Cys Gly Val Lys Ser Thr Asp Gly Lys Gin Gly Pro 
185 190 195 

ATT CGA ATC TCT AGA TCA CTC AAA AGC CCA GAG AAA GAA GAC TTT CTT 679 
He Arg He Ser Arg Ser Leu Lys Ser Pro Glu Lys Glu Asp Phe Leu 
200 205 210 

CGG GCA CAG AAA TAC ATT GAT CTC TAT TTG GTG CTG GAT AAT GCC TTT 72 7 

Arg Ala Gin Lys Tyr He Asp Leu Tyr Leu Val Leu Asp Asn Ala Phe 
215 220 225 230 

TAT AAG AAC TAT AAT GAG AAT CTA ACT CTG ATA AGA AGC TTT GTG TTT 77 5 
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Tyr Lys Asn Tyr Asn Glu Asn Leu Thr Leu lie Arg Ser Phe Val Phe 
235 240 245 
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GAT GTG ATG AAC CTA CTC AAT GTG ATA TAT AAC ACC ATA GAT GTT CAA 
Asp Val Met Asn Leu Leu Asn Val He Tyr Asn Thr He Asp Val Gin 
250 255 260 



823 



GTG GCC TTG GTA GGT ATG GAA ATC TGG TCT GAT GGG GAT AAG ATA AAG 871 
Val Ala Leu Val Gly Met Glu He Trp Ser Asp Gly Asp Lys He Lys 
265 270 275 

GTG GTG CCC AGC GCA AGC ACC ACG TTT GAC AAC TTC CTG AGA TGG CAC 919 
Val Val Pro Ser Ala Ser Thr Thr Phe Asp Asn Phe Leu Arg Trp His 
280 285 290 

AGT TCT AAC CTG GGG AAA AAG ATC CAC GAC CAT GCT CAG CTT CTC AGC 967 
Ser Ser Asn Leu Gly Lys Lys He His Asp His Ala Gin Leu Leu Ser 
295 300 305 310 

GGG ATT AGC TTC AAC AAT CGA CGT GTG GGA CTG GCA GCT TCA AAT TCC 1015 
Gly He Ser Phe Asn Asn Arg Arg Val Gly Leu Ala Ala Ser Asn Ser 
315 320 325 

TTG TGT TCC CCA TCT TCG GTT GCT GTT ATT GAG GCT AAA AAA AAG AAT 1063 
Leu Cys Ser Pro Ser Ser Val Ala Val He Glu Ala Lys Lys Lys Asn 
330 335 340 

AAT GTG GCT CTT GTA GGA GTG ATG TCA CAT GAG CTG GGC CAT GTC CTT 1111 
Asn Val Ala Leu Val Gly Val Met Ser His Glu Leu Gly His Val Leu 
345 350 355 

GGT ATG CCT GAT GTT CCA TTC AAC ACC AAG TGT CCC TCT GGC AGT TGT 1159 
Gly Met Pro Asp Val Pro Phe Asn Thr Lys Cys Pro Ser Gly Ser Cys 
360 365 370 

GTG ATG AAT CAG TAT CTG AGT TCA AAA TTC CCA AAG GAT TTC AGT ACA 1207 
Val Met Asn Gin Tyr Leu Ser Ser Lys Phe Pro Lys Asp Phe Ser Thr 
375 380 385 390 

TCT TGC CGT GCA CAT TTT GAA AGA TAC CTT TTA TCT CAG AAA CCA AAG 1255 
Ser Cys Arg Ala His Phe Glu Arg Tyr Leu Leu Ser Gin Lys Pro Lys 
395 400 405 

TGC CTG CTG CAA GCA CCT ATT CCT ACA AAT ATA ATG ACA ACA CCA GTG 13 03 

Cys Leu Leu Gin Ala Pro He Pro Thr Asn He Met Thr Thr Pro Val 
410 415 420 

TGT GGG AAC CAC CTT CTA GAA GTG GGA GAA GAC TGT GAT TGT GGC TCT 13 51 

Cys Gly Asn His Leu Leu Glu Val Gly Glu Asp Cys Asp Cys Gly Ser 
425 430 435 

CCT AAG GAG TGT ACC AAT CTC TGC TGT GAA GCC CTA ACG TGT AAA CTG 13 99 

Pro Lys Glu Cys Thr Asn Leu Cys Cys Glu Ala Leu Thr Cys Lys Leu 
440 445 450 

AAG CCT GGA ACT GAT TGC GGA GGA GAT GCT CCA AAC CAT ACC ACA GAG 1447 
Lys Pro Gly Thr Asp Cys Gly Gly Asp Ala Pro Asn His Thr Thr Glu 
455 460 465 470 



TGAATCCAAA AGTCTGCTTC ACTGAGATGC TACCTTGCCA GGACAAGAAC CAAGAACTCT 1507 

AACTGTCCCA GGAATCTTGT GAATTTTCAC CCATAATGGT CTTTCACTTG TCATTCTACT 1567 

TTCTATATTG TTATCAGTCC AGGAAACAGG TAAACAGATG TAATTAGAGA CATTGGCTCT 1627 

TTGTTTAGGC CTAATCTTTC TTTTTACTTT TTTTTTTCTT TTTTCTTTTT TTTTAAAGAT 1687 
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CATGAATTTG TGACTTAGTT CTGCCCTTTG GAGAACAAAA GAAAGCAGTC TTCCATCAAA 1747 

TCACCTTAAA ATGCACGGCT AAACTATTCA GAGTTAACAC TCCAGAATTG TTAAATTACA 18 07 

AGTACTATGC TTTAATGCTT CTTTCATCTT ACTAGTATGG CCTATAAAAA AAATAATACC 1867 

ACTTGATGGG TGAAGGCTTT GGCAATAGAA AGAAGAATAG AATTCAGGTT TTATGTTATT 1927 

CCTCTGTGTT CACTTCGCCT TGCTCTTGAA AGTGCAGTAT TTTTCTACAT CATGTCGAGA 1987 

ATGATTCAAT GTAAATATTT TTCATTTTAT CATGTATATC C TAT AC AC AC ATCTCCTTCA 2 047 

TCATCATATA TGAAGTTTAT TTTGAGAAGT CTACATTGCT TACATTTTAA TTGAGCCAGC 2107 

AAAGAAGGCT TAATGATTTA TTGAACCATA ATGTCAATAA AAACACAACT TTTGAGGCAA 2167 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA 2209 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Leu Arg Gly lie Ser Gin Leu Pro Ala Val Ala Thr Met Ser Trp 
15 10 15 

Val Leu Leu Pro Val Leu Trp Leu He Val Gin Thr Gin Ala He Ala 
20 25 30 

He Lys Gin Thr Pro Glu Leu Thr Leu His Glu He Val Cys Pro Lys 
35 40 45 

Lys Leu His He Leu His Lys Arg Glu He Lys Asn Asn Gin Thr Glu 
50 55 60 

Lys His Gly Lys Glu Glu Arg Tyr Glu Pro Glu Val Gin Tyr Gin Met 

65 70 75 80 

He Leu Asn Gly Glu Glu He He Leu Ser Leu Gin Lys Thr Lys His 
85 90 95 

Leu Leu Gly Pro Asp Tyr Thr Glu Thr Leu Tyr Ser Pro Arg Gly Glu 
100 105 110 

Glu He Thr Thr Lys Pro Glu Asn Met Glu His Cys Tyr Tyr Lys Gly 
115 120 125 

Asn He Leu Asn Glu Lys Asn Ser Val Ala Ser He Ser Thr Cys Asp 
130 135 140 
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Gin Glu Glu Gin Asp Pro Ala Asn His Thr Cys Gly Val Lys Ser Thr 
180 185 190 

Asp Gly Lys Gin Gly Pro He Arg He Ser Arg Ser Leu Lys Ser Pro 
195 200 205 

Glu Lys Glu Asp Phe Leu Arg Ala Gin Lys Tyr He Asp Leu Tyr Leu 
210 215 220 

Val Leu Asp Asn Ala Phe Tyr Lys Asn Tyr Asn Glu Asn Leu Thr Leu 
225 230 235 240 

He Arg Ser Phe Val Phe Asp Val Met Asn Leu Leu Asn Val He Tyr 
245 250 255 

Asn Thr He Asp Val Gin Val Ala Leu Val Gly Met Glu He Trp Ser 
260 265 270 

Asp Gly Asp Lys He Lys Val Val Pro Ser Ala Ser Thr Thr Phe Asp 
275 280 285 

Asn Phe Leu Arg Trp His Ser Ser Asn Leu Gly Lys Lys He His Asp 
290 295 300 

His Ala Gin Leu Leu Ser Gly He Ser Phe Asn Asn Arg Arg Val Gly 
305 310 315 320 

Leu Ala Ala Ser Asn Ser Leu Cys Ser Pro Ser Ser Val Ala Val He 
325 330 335 

Glu Ala Lys Lys Lys Asn Asn Val Ala Leu Val Gly Val Met Ser His 
340 345 350 

Glu Leu Gly His Val Leu Gly Met Pro Asp Val Pro Phe Asn Thr Lys 
355 360 365 

Cys Pro Ser Gly Ser Cys Val Met Asn Gin Tyr Leu Ser Ser Lys Phe 
370 375 380 

Pro Lys Asp Phe Ser Thr Ser Cys Arg Ala His Phe Glu Arg Tyr Leu 
385 390 395 400 

Leu Ser Gin Lys Pro Lys Cys Leu Leu Gin Ala Pro He Pro Thr Asn 
405 410 415 

He Met Thr Thr Pro Val Cys Gly Asn His Leu Leu Glu Val Gly Glu 
420 425 430 

Asp Cys Asp Cys Gly Ser Pro Lys Glu Cys Thr Asn Leu Cys Cys Glu 
435 440 445 

Ala Leu Thr Cys Lys Leu Lys Pro Gly Thr Asp Cys Gly Gly Asp Ala 
450 455 460 

Pro Asn His Thr Thr Glu 
465 470 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 52.. 2034 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

ATTTCTCAGC TCCAAGCATT AGGTAAACCC ACCAAGCAAT CCTAGCCTGT G ATG GCG 57 

Met Ala 
1 

TTT GAC GTC AGC TGC TTC TTT TGG GTG GTG CTG TTT TCT GCC GGC TGT 105 
Phe Asp Val Ser Cys Phe Phe Trp Val Val Leu Phe Ser Ala Gly Cys 
5 10 15 

AAA GTC ATC ACC TCC TGG GAT CAG ATG TGC ATT GAG AAA GAA GCC AAC 153 
Lys Val lie Thr Ser Trp Asp Gin Met Cys lie Glu Lys Glu Ala Asn 
20 25 30 

AAA ACA TAT AAC TGT GAA AAT TTA GGT CTC AGT GAA ATC CCT GAC ACT 201 
Lys Thr Tyr Asn Cys Glu Asn Leu Gly Leu Ser Glu lie Pro Asp Thr 
35 40 45 50 

CTA CCA AAC ACA ACA GAA TTT TTG GAA TTC AGC TTT AAT TTT TTG CCT 249 
Leu Pro Asn Thr Thr Glu Phe Leu Glu Phe Ser Phe Asn Phe Leu Pro 
55 60 65 

ACA ATT CAC AAT AGA ACC TTC AGC AGA CTC ATG AAT CTT ACC TTT TTG 297 
Thr lie His Asn Arg Thr Phe Ser Arg Leu Met Asn Leu Thr Phe Leu 
70 75 80 

GAT TTA ACT AGG TGC CAG ATT AAC TGG ATA CAT GAA GAC ACT TTT CAA 345 
Asp Leu Thr Arg Cys Gin lie Asn Trp lie His Glu Asp Thr Phe Gin 
85 90 95 

AGC CAT CAT CAA TTA AGC ACA CTT GTG TTA ACT GGA AAT CCC CTG ATA 3 93 

Ser His His Gin Leu Ser Thr Leu Val Leu Thr Gly Asn Pro Leu He 
100 105 110 

TTC ATG GCA GAA ACA TCG CTT AAT GGG CCC AAG TCA CTG AAG CAT CTT 441 
Phe Met Ala Glu Thr Ser Leu Asn Gly Pro Lys Ser Leu Lys His Leu 
115 120 125 130 

TTC TTA ATC CAA ACG GGA ATA TCC AAT CTC GAG TTT ATT CCA GTG CAC 489 
Phe Leu He Gin Thr Gly He Ser Asn Leu Glu Phe He Pro Val His 
135 140 145 

AAT CTG GAA AAC TTG GAA AGC TTG TAT CTT GGA AGC AAC CAT ATT TCC 537 
Asn Leu Glu Asn Leu Glu Ser Leu Tyr Leu Gly Ser Asn His He Ser 

1^0 155 160 



GAT TTT CAG AAT AAT GCT ATA CAC TAC ATC TCT AGA GAA GAC ATG AGG 63 3 
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Asp Phe Gin Asn Asn Ala lie His Tyr He Ser Arg Glu Asp Met Arq 
180 185 190 

TCT CTG GAG CAG GCC ATC AAC CTA AGC CTG AAC TTC AAT GGC AAT AAT 681 
Ser Leu Glu Gin Ala He Asn Leu Ser Leu Asn Phe Asn Gly Asn Asn 
195 200 205 210 

GTT AAA GGT ATT GAG CTT GGG GCT TTT GAT TCA ACG GTC TTC CAA AGT 729 
Val Lys Gly He Glu Leu Gly Ala Phe Asp Ser Thr Val Phe Gin Ser 
215 220 225 

TTG AAC TTT GGA GGA ACT CCA AAT TTG TCT GTT ATA TTC AAT GGT CTG 777 
Leu Asn Phe Gly Gly Thr Pro Asn Leu Ser Val He Phe Asn Gly Leu 
230 235 240 

CAG AAC TCT ACT ACT CAG TCT CTC TGG CTG GGA AC A TTT GAG GAC ATT 825 
Gin Asn Ser Thr Thr Gin Ser Leu Trp Leu Gly Thr Phe Glu Asp He 
245 250 255 

GAT GAC GAA GAT ATT AGT TCA GCC ATG CTC AAG GGA CTC TGT GAA ATG 873 
Asp Asp Glu Asp He Ser Ser Ala Met Leu Lys Gly Leu Cys Glu Met 
260 265 270 

TCT GTT GAG AGC CTC AAC CTG CAG GAA CAC CGC TTC TCT GAC ATC TCA 921 
Ser Val Glu Ser Leu Asn Leu Gin Glu His Arg Phe Ser Asp He Ser 
275 280 285 290 

TCC ACC ACA TTT CAG TGC TTC ACC CAA CTC CAA GAA TTG GAT CTG ACA 969 
Ser Thr Thr Phe Gin Cys Phe Thr Gin Leu Gin Glu Leu Asp Leu Thr 
295 300 305 

GCA ACT CAC TTG AAA GGG TTA CCC TCT GGG ATG AAG GGT CTG AAC TTG 1017 
Ala Thr His Leu Lys Gly Leu Pro Ser Gly Met Lys Gly Leu Asn Leu 
310 315 320 

CTC AAG AAA TTA GTT CTC AGT GTA AAT CAT TTC GAT CAA TTG TGT CAA 1065 
Leu Lys Lys Leu Val Leu Ser Val Asn His Phe Asp Gin Leu Cys Gin 
325 330 335 

ATC AGT GCT GCC AAT TTC CCC TCC CTT ACA CAC CTC TAC ATC AGA GGC 1113 
He Ser Ala Ala Asn Phe Pro Ser Leu Thr His Leu Tyr He Arg Gly 
340 345 350 

AAC GTG AAG AAA CTT CAC CTT GGT GTT GGC TGC TTG GAG AAA CTA GGA 1161 
Asn Val Lys Lys Leu His Leu Gly Val Gly Cys Leu Glu Lys Leu Gly 
355 360 365 370 

AAC CTT CAG ACA CTT GAT TTA AGC CAT AAT GAC ATA GAG GCT TCT GAC 12 09 

Asn Leu Gin Thr Leu Asp Leu Ser His Asn Asp He Glu Ala Ser Asp 
375 380 385 

TGC TGC AGT CTG CAA CTC AAA AAC CTG TCC CAC TTG CAA ACC TTA AAC 1257 
Cys Cys Ser Leu Gin Leu Lys Asn Leu Ser His Leu Gin Thr Leu Asn 
390 395 400 

CTG AGC CAC AAT GAG CCT CTT GGT CTC CAG AGT CAG GCA TTC AAA GAA 13 05 

Leu Ser His Asn Glu Pro Leu Gly Leu Gin Ser Gin Ala Phe Lys Glu 
405 410 415 

TGT CCT CAG CTA GAA CTC CTC GAT TTG GCA TTT ACC CGC TTA CAC ATT 13 53 

Cys Pro Gin Leu Glu Leu Leu Asp Leu Ala Phe Thr Arg Leu His He 
420 425 430 
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AAT GCT CCA CAA AGT CCC TTC CAA AAC CTC CAT TTC CTT CAG GTT CTG 14 01 

Asn Ala Pro Gin Ser Pro Phe Gin Asn Leu His Phe Leu Gin Val Leu 
435 440 445 450 

AAT CTC ACT TAC TGC TTC CTT GAT ACC AGC AAT CAG CAT CTT CTA GCA 1449 
Asn Leu Thr Tyr Cys Phe Leu Asp Thr Ser Asn Gin His Leu Leu Ala 
455 460 465 

GGC CTA CCA GTT CTC CGG CAT CTC AAC TTA AAA GGG AAT CAC TTT CAA 1497 
Gly Leu Pro Val Leu Arg His Leu Asn Leu Lys Gly Asn His Phe Gin 
470 475 480 

GAT GGG ACT ATC ACG AAG ACC AAC CTA CTT CAG ACC GTG GGC AGC TTG 1545 
Asp Gly Thr lie Thr Lys Thr Asn Leu Leu Gin Thr Val Gly Ser Leu 
485 490 495 

GAG GTT CTG ATT TTG TCC TCT TGT GGT CTC CTC TCT ATA GAC CAG CAA 1593 
Glu Val Leu lie Leu Ser Ser Cys Gly Leu Leu Ser lie Asp Gin Gin 
500 505 510 

GCA TTC CAC AGC TTG GGA AAA ATG AGC CAT GTA GAC TTA AGC CAC AAC 1641 
Ala Phe His Ser Leu Gly Lys Met Ser His Val Asp Leu Ser His Asn 
515 520 525 530 

AGC CTG ACA TGC GAC AGC ATT GAT TCT CTT AGC CAT CTT AAG GGA ATC 1689 
Ser Leu Thr Cys Asp Ser lie Asp Ser Leu Ser His Leu Lys Gly lie 
535 540 545 

TAC CTC AAT CTG GCT GCC AAC AGC ATT AAC ATC ATC TCA CCC CGT CTC 1737 
Tyr Leu Asn Leu Ala Ala Asn Ser lie Asn lie lie Ser Pro Arg Leu 
550 555 560 

CTC CCT ATC TTG TCC CAG CAG AGC ACC ATT AAT TTA AGT CAT AAC CCC 17 85 

Leu Pro lie Leu Ser Gin Gin Ser Thr lie Asn Leu Ser His Asn Pro 
565 570 575 

CTG GAC TGC ACT TGC TCG AAT ATT CAT TTC TTA ACA TGG TAC AAA GAA 1833 
Leu Asp Cys Thr Cys Ser Asn lie His Phe Leu Thr Trp Tyr Lys Glu 
580 585 590 

AAC CTG CAC AAA CTT GAA GGC TCG GAG GAG ACC ACG TGT GCA AAC CCG 1881 
Asn Leu His Lys Leu Glu Gly Ser Glu Glu Thr Thr Cys Ala Asn Pro 
595 600 605 610 

CCA TCT CTA AGG GGA GTT AAG CTA TCT GAT GTC AAG CTT TCC TGT GGG 1929 
Pro Ser Leu Arg Gly Val Lys Leu Ser Asp Val Lys Leu Ser Cys Gly 
615 620 625 

ATT ACA GCC ATA GGC ATT TTC TTT CTC ATA GTA TTT CTA TTA TTG TTG 1977 
He Thr Ala He Gly lie Phe Phe Leu He Val Phe Leu Leu Leu Leu 
630 635 640 

GCT ATT CTG CTA TTT TTT GCA GTT AAA TAC CTT CTC AGG TGG AAA TAC 2 025 

Ala He Leu Leu Phe Phe Ala Val Lys Tyr Leu Leu Arg Trp Lys Tyr 
645 650 655 

^ A A CAC ATT TAGTGCTGAA GGTTTCCAGA GAAAGCAAAT AAGTGTGCTT 2 074 
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ATTGCTTCCT GGAACTGGGC AGGGACTCAC TGTGCTTTTC TGAGCTTCTT ACTCCTGTGA 2194 
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GTCCCAGAGC TAAAGAACCT TCTAGGCAAG TACACCGAAT GACTCAGTCC AGAGGGTCAG 22 54 

ATGCTGCTGT GAGAGGCACA GAGCCCTTTC CGCATGTGGA AGAGTGGGAG GAAGCAGAGG 2314 

GAGGGACTGG GCAGGGACTG CCGGCCCCGG AGTCTCCCAC AGGGAGGCCA TTCCCCTTCT 23 74 

ACTCACCGAC ATCCCTCCCA GCACCACACA CCCCGCCCCT GAAAGGAGAT CATCAGCCCC 2 434 

CACAATTTGT CAGAGCTGAA GCCAGCCCAC TACCCACCCC CACTACAGCA TTGTGCTTGG 2 494 

GTCTGGGTTC TCAGTAATGT AGCCATTTGA GAAACTTACT TGGGGACAAA GTCTCAATCC 2 554 

TTATTTTAAA TGAAAAAAAA AAAAAAAA 25 82 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 661 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ala Phe Asp Val Ser Cys Phe Phe Trp Val Val Leu Phe Ser Ala 
1 5 -10 15 

Gly Cys Lys Val He Thr Ser Trp Asp Gin Met Cys He Glu Lys Glu 
20 25 30 

Ala Asn Lys Thr Tyr Asn Cys Glu Asn Leu Gly Leu Ser Glu He Pro 
35 40 45 

Asp Thr Leu Pro Asn Thr Thr Glu Phe Leu Glu Phe Ser Phe Asn Phe 
50 55 60 

Leu Pro Thr He His Asn Arg Thr Phe Ser Arg Leu Met Asn Leu Thr 
65 70 75 80 

Phe Leu Asp Leu Thr Arg Cys Gin He Asn Trp He His Glu Asp Thr 
85 90 95 

Phe Gin Ser His His Gin Leu Ser Thr Leu Val Leu Thr Gly Asn Pro 
100 105 HO 

Leu He Phe Met Ala Glu Thr Ser Leu Asn Gly Pro Lys Ser Leu Lys 
115 120 125 

His Leu Phe Leu He Gin Thr Gly He Ser Asn Leu Glu Phe He Pro 
130 135 i4 0 

Val His Asn Leu Glu Asn Leu Glu Ser Leu Tyr Leu Gly Ser Asn His 
14 5 150 155 160 

He Ser Ser He Lys Phe Pro Lys Asp Phe Pro Ala Arg Asn Leu Lys 
165 170 175 

Val Leu Asp Phe Gin Asn Asn Ala He His Tyr He Ser Arg Glu Asp 
180 185 190 

Met Arg Ser Leu Glu Gin Ala He Asn Leu Ser Leu Asn Phe Asn Gly 
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195 200 205 

Asn Asn Val Lys Gly lie Glu Leu Gly Ala Phe Asp Ser Thr Val Phe 
210 215 220 

Gin Ser Leu Asn Phe Gly Gly Thr Pro Asn Leu Ser Val lie Phe Asn 
225 230 235 240 

Gly Leu Gin Asn Ser Thr Thr Gin Ser Leu Trp Leu Gly Thr Phe Glu 
245 250 255 

Asp lie Asp Asp Glu Asp lie Ser Ser Ala Met Leu Lys Gly Leu Cys 
260 265 270 

Glu Met Ser Val Glu Ser Leu Asn Leu Gin Glu His Arg Phe Ser Asp 
275 280 285 

He Ser Ser Thr Thr Phe Gin Cys Phe Thr Gin Leu Gin Glu Leu Asp 
290 295 300 

Leu Thr Ala Thr His Leu Lys Gly Leu Pro Ser Gly Met Lys Gly Leu 
305 310 315 320 

Asn Leu Leu Lys Lys Leu Val Leu Ser Val Asn His Phe Asp Gin Leu 
325 330 335 

Cys Gin He Ser Ala Ala Asn Phe Pro Ser Leu Thr His Leu Tyr He 
340 345 350 

Arg Gly Asn Val Lys Lys Leu His Leu Gly Val Gly Cys Leu Glu Lys 
355 360 365 

Leu Gly Asn Leu Gin Thr Leu Asp Leu Ser His Asn Asp He Glu Ala 
370 375 380 

Ser Asp Cys Cys Ser Leu Gin Leu Lys Asn Leu Ser His Leu Gin Thr 
385 390 395 400 

Leu Asn Leu Ser His Asn Glu Pro Leu Gly Leu Gin Ser Gin Ala Phe 
405 410 415 

Lys Glu Cys Pro Gin Leu Glu Leu Leu Asp Leu Ala Phe Thr Arg Leu 
420 425 430 

His He Asn Ala Pro Gin Ser Pro Phe Gin Asn Leu His Phe Leu Gin 
435 440 445 

Val Leu Asn Leu Thr Tyr Cys Phe Leu Asp Thr Ser Asn Gin His Leu 
450 455 460 

Leu Ala Gly Leu Pro Val Leu Arg His Leu Asn Leu Lys Gly Asn His 
465 470 475 480 

Phe Gin Asp Gly Thr He Thr Lys Thr Asn Leu Leu Gin Thr Val Gly 
485 490 495 

Ser Leu Glu Val Leu He Leu Ser Ser Cys Gly Leu Leu Ser He Asp 



His Asn Ser Leu Thr Cys Asp Ser lie Asp Ser Leu Ser His Leu Lys 
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530 535 540 

Gly He Tyr Leu Asn Leu Ala Ala Asn Ser He Asn He He Ser Pro 
545 550 555 560 

Arg Leu Leu Pro He Leu Ser Gin Gin Ser Thr He Asn Leu Ser His 
565 570 575 

Asn Pro Leu Asp Cys Thr Cys Ser Asn He His Phe Leu Thr Trp Tyr 
580 585 590 

Lys Glu Asn Leu His Lys Leu Glu Gly Ser Glu Glu Thr Thr Cys Ala 
595 600 605 

Asn Pro Pro Ser Leu Arg Gly Val Lys Leu Ser Asp Val Lys Leu Ser 
610 615 620 

Cys Gly He Thr Ala He Gly He Phe Phe Leu He Val Phe Leu Leu 
625 630 635 640 

Leu Leu Ala He Leu Leu Phe Phe Ala Val Lys Tyr Leu Leu Arg Trp 
645 650 655 

Lys Tyr Gin His He 
660 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 76. .474 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CGGCCAAAGA GGCCTAAACT TGCGGCTGTC CATCTCACCT ACAGCTCTGG TCTCATCCTC 6 0 

AACTCAACCA CAATC ATG GCT CAG ATG ATG ACT CTG AGC CTC CTT AGC CTG 111 
Met Ala Gin Met Met Thr Leu Ser Leu Leu Ser Leu 
15 10 

GTC CTG GCT CTC TGC ATC CCC TGG ACC CAA GGC AGT GAT GGA GGG GGT 159 
Val Leu Ala Leu Cys He Pro Trp Thr Gin Gly Ser Asp Gly Gly Gly 
15 20 25 

CAG GAC TGC TGC CTT AAG TAC AGC CAG AAG AAA ATT CCC TAC AGT ATT 207 
Gin Asp Cys Cys Leu Lys Tyr Ser Gin Lys Lys He Pro Tyr Ser He 
30 35 40 

GTC CGA GGC TAT AGG AAG CAA GAA CCA AGT TTA GGC TGT CCC ATC CCG 255 
Val Arg Gly Tyr Arg Lys Gin Glu Pro Ser Leu Gly Cys Pro He Pro 
45 50 55 60 
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GCA ATC CTG TTC TCA CCC CGG AAG CAC TCT AAG CCT GAG CTA TGT GCA 3 03 

Ala lie Leu Phe Ser Pro Arg Lys His Ser Lys Pro Glu Leu Cys Ala 
65 70 75 

AAC CCT GAG GAA GGC TGG GTG CAG AAC CTG ATG CGC CGC CTG GAC CAG 3 51 

Asn Pro Glu Glu Gly Trp Val Gin Asn Leu Met Arg Arg Leu Asp Gin 
80 85 90 

CCT CCA GCC CCA GGG AAA CAA AGC CCC GGC TGC AGG AAG AAC CGG GGA 3 99 

Pro Pro Ala Pro Gly Lys Gin Ser Pro Gly Cys Arg Lys Asn Arg Gly 
95 100 105 

ACC TCT AAG TCT GGA AAG AAA GGA AAG GGC TCC AAG GGC TGC AAG AGA 447 
Thr Ser Lys Ser Gly Lys Lys Gly Lys Gly Ser Lys Gly Cys Lys Arg 
110 115 120 

ACT GAA CAG ACA CAG CCC TCA AGA GGA TAGCCCAGTA GCCCGCCTGG 494 
Thr Glu Gin Thr Gin Pro Ser Arg Gly 
125 130 

AGCCCAGGAG ATCCCCCACG AACTTCAAGC TGGGTGGTTC ACGGTCCAAC TCACAGGCAA 554 
AGAGGGAGCT AGAAAACAGA CTCAGGAGCC GCTA 588 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Met Ala Gin Met Met Thr Leu Ser Leu Leu Ser Leu Val Leu Ala Leu 
15 10 15 

Cys lie Pro Trp Thr Gin Gly Ser Asp Gly Gly Gly Gin Asp Cys Cys 
20 25 30 

Leu Lys Tyr Ser Gin Lys Lys lie Pro Tyr Ser lie Val Arg Gly Tyr 
35 40 45 

Arg Lys Gin Glu Pro Ser Leu Gly Cys Pro lie Pro Ala lie Leu Phe 
50 55 60 

Ser Pro Arg Lys His Ser Lys Pro Glu Leu Cys Ala Asn Pro Glu Glu 
65 70 75 80 

Gly Trp Val Gin Asn Leu Met Arg Arg Leu Asp Gin Pro Pro Ala Pro 
85 90 95 

Gly Lys Gin Ser Pro Gly Cys Arg Lys Asn Arg Gly Thr Ser Lys Ser 
100 105 110 
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(2) INFORMATION FOR SEQ ID NO; 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 966 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 67.. 348 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CTTCCAAGAA GAGCAGCAAA GCTGAAGTAG CAGCAACAGC ACCAGCAGCA ACAGCAAAAA 60 

ACAAAC ATG AGT GTG AAG GGC ATG GCT ATA GCC TTG GCT GTG ATA TTG 108 
Met Ser Val Lys Gly Met Ala He Ala Leu Ala Val He Leu 
1 5 io 

TGT GCT ACA GTT GTT CAA GGC TTC CCC ATG TTC AAA AGA GGA CGC TGT 156 
Cys Ala Thr Val Val Gin Gly Phe Pro Met Phe Lys Arg Gly Arg Cys 
15 20 25 30 

CTT TGC ATA GGC CCT GGG GTA AAA GCA GTG AAA GTG GCA GAT ATT GAG 2 04 

Leu Cys He Gly Pro Gly Val Lys Ala Val Lys Val Ala Asp He Glu 
35 40 45 

AAA GCC TCC ATA ATG TAC CCA AGT AAC AAC TGT GAC AAA ATA GAA GTG 252 
Lys Ala Ser He Met Tyr Pro Ser Asn Asn Cys Asp Lys He Glu Val 
50 55 60 

ATT ATT ACC CTG AAA GAA AAT AAA GGA CAA CGA TGC CTA AAT CCC AAA 3 00 

He He Thr Leu Lys Glu Asn Lys Gly Gin Arg Cys Leu Asn Pro Lys 
65 70 75 

TCG AAG CAA GCA AGG CTT ATA ATC AAA AAA GTT GAA AGA AAG AAT TTT 348 
Ser Lys Gin Ala Arg Leu He He Lys Lys Val Glu Arg Lys Asn Phe 
80 85 90 

TAAAAATATC AAAACATATG AAGTCCTGGA AAAGGGCATC TGAAAAACCT AGAACAAGTT 408 

TAACTGTGAC TACTGAAATG ACAAGAATTC TACAGTAGGA AACTGAGACT TTTCTATGGT 46 8 

TTTGTGACTT TCAACTTTTG TACAGTTATG TGAAGGATGA AAGGTGGGTG AAAGGACCAA 52 8 

AAACAGAAAT ACAGTCTTCC TGAATGAATG ACAATCAGAA TTCCACTGCC CAAAGGAGTC 588 

CAACAATTAA ATGGATTTCT AGGAAAAGCT ACCTTAAGAA AGGCTGGTTA CCATCGGAGT 64 8 

TTACAAAGTG CTTTCACGTT CTTACTTGTT GTATTATACA TTCATGCATT TCTAGGCTAG 708 

AGAACCTTCT AGATTTGATG CTTACAACTA TTCTGTTGTG ACTATGAGAA CATTTCTGTC 768 

TCTAGAAGTT ATCTGTCTGT ATTGATCTTT ATGCTATATT ACTATCTGTG GTTACAGTGG 828 

AGACATTGAC ATTATTACTG GAGTCAAGCC CTTATAAGTC AAAAGCACCT ATGTGTCGTA 8 88 
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AAGCATTCCT CAAACATTTA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 948 
AAAAAAAAAA AAAAAAAA 966 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Ser Val Lys Gly Met Ala He Ala Leu Ala Val He Leu Cys Ala 
15 10 15 

Thr Val Val Gin Gly Phe Pro Met Phe Lys Arg Gly Arg Cys Leu Cys 
20 25 30 

He Gly Pro Gly Val Lys Ala Val Lys Val Ala Asp He Glu Lys Ala 
35 40 45 

Ser He Met Tyr Pro Ser Asn Asn Cys Asp Lys He Glu Val He He 
50 55 60 

Thr Leu Lys Glu Asn Lys Gly Gin Arg Cys Leu Asn Pro Lys Ser Lys 
65 70 75 80 

Gin Ala Arg Leu He He Lys Lys Val Glu Arg Lys Asn Phe 
85 90 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1354 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

( B) LOCATION: 75. .356 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
TTCTACTCCT TCCAAGAAGA GCAGCAAAGC TGAAGTAGCA GCAACAGCAC CAGCAGCAAC 60 

^-AAAAAA- A A AC ATG AGT GTG AAG GGC ATG GCT ATA GCC TTG GCT GTG 110 



Ja.^ Gin Giy : ] nt 
20 



Me*. 



: rife _y 
25 



WO 97/07198 



PCT/US96/12897 



CGC TGT CTT TGC ATA GGC CCT GGG GTA AAA GCA GTG AAA GTG GCA GAT 206 
Arg Cys Leu Cys He Gly Pro Gly Val Lys Ala Val Lys Val Ala Asp 
30 35 40 

ATT GAG AAA GCC TCC ATA ATG TAC CCA AGT AAC AAC TGT GAC AAA ATA 254 
He Glu Lys Ala Ser He Met Tyr Pro Ser Asn Asn Cys Asp Lys He 
45 50 55 60 

GAA GTG ATT ATT ACC CTG AAA GAA AAT AAA GGA CAA CGA TGC CTA AAT 3 02 

Glu Val He He Thr Leu Lys Glu Asn Lys Gly Gin Arg Cys Leu Asn 
65 70 75 

CCC AAA TCG AAG CAA GCA AGG CTT ATA ATC AAA AAA GTT GAA AGA AAG 3 50 

Pro Lys Ser Lys Gin Ala Arg Leu He He Lys Lys Val Glu Arg Lys 
80 85 90 

AAT TTT TAAAAATATC AAAACATATG AAGTCCTGGA AAAGGGCATC TGAAAAACCT 4 06 

Asn Phe 



AGAACAAGTT 


TAACTGTGAC 


TACTGAAATG 


ACAAGAATTC 


TACAGTAGGA 


AACTGAGACT 


466 


TTTCTATGGT 


TTTGTGACTT 


TCAACTTTTG 


TACAGTTATG 


TGAAGGATGA 


AAGGTGGGTG 


526 


AAAijVj AC CAA 


AAACAGAAAT 


ACAGTCTTCC 


TGAATGAATG 


ACAATCAGAA 


TTCCACTGCC 


586 


CAAAGGAGTC 


CAACAATTAA 


ATGGATTTCT 


AGGAAAAGCT 


ACCTTAAGAA 


AGGCTGGTTA 


646 


CCATCGGAGT 


TTACAAAGTG 


CTTTCACGTT 


CTTACTTGTT 


GTATTATACA TTCATGCATT 


706 


TCTAGGCTAG 


AGAACCTTCT 


AGATTTGATG 


CTTACAACTA 


TTCTGTTGTG 


ACTATGAGAA 


766 


CATTTCTGTC 


TCTAGAAGTT 


ATCTGTCTGT 


ATTGATCTTT 


ATGCTATATT 


ACTATCTGTG 


826 


GTTACAGTGG 


AGACATTGAC 


ATTATTACTG 


GAGTCAAGCC 


CTTATAAGTC 


AAAAGCACCT 


886 


ATGTGTCGTA 


AAGCATTCCT 


CAAACATTTT 


TTCATGCAAA 


TACACACTTC 


TTTCCCCAAA 


946 


TATCATGTAG 


CACATCAATA 


TGTAGGGAAA 


CATTCTTATG 


CATCATTTGG 


TTTGTTTTAT 


1006 


AACCAATTCA 


TTAAATGTAA 


TTCATAAAAT 


GTACTATGAA 


AAAAATTATA 


CGCTATGGGA 


1066 


TACTGGCAAC 


AGTGCACATA 


TTTCATAACC 


AAATTAGCAG 


CACCGGTCTT 


AATTTGATGT 


1126 


TTTTCAACTT 


TTATTCATTG 


AGATGTTTTG 


AAGCAATTAG 


GATATGTGTG 


TTTACTGTAC 


1186 


TTTTTGTTTT 


GATCCGTTTG 


TATAAATGAT 


AGCAATATCT 


TGGACACATT 


TGAAATACAA 


1246 


AATGTTTTTG 


TCTACCAAAG 


AAAAATGTTG 


AAAAATAAGC 


AAATGTATAC 


CTAGCAATCA 


1306 


CTTTTACTTT 


TTGTAATTCT 


GTCTCTTAGA 


AAAATACATA 


ATCTAATT 




1354 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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Met Ser Val Lys Gly Met Ala lie Ala Leu Ala Val He Leu Cys Ala 
15 10 15 

Thr Val Val Gin Gly Phe Pro Met Phe Lys Arg Gly Arg Cys Leu Cys 
20 25 30 

He Gly Pro Gly Val Lys Ala Val Lys Val Ala Asp He Glu Lys Ala 
35 40 45 

Ser He Met Tyr Pro Ser Asn Asn Cys Asp Lys He Glu Val He He 
50 55 60 

Thr Leu Lys Glu Asn Lys Gly Gin Arg Cys Leu Asn Pro Lys Ser Lys 
65 70 75 80 

Gin Ala Arg Leu He He Lys Lys Val Glu Arg Lys Asn Phe 
85 90 

(2) INFORMATION FOR SEQ ID NO: 11: 

H) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE; 

(A) NAME /KEY : CDS 

(B) LOCATION: 86.. 544 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGGAAGATAC ATTCACAGAA AGAGCTTCCT GCACAAAGTA AGCCACCAGC GCAACATGAC 60 

AGTGAAGACC CTGCATGGCC CAGCC ATG GTC AAG TAC TTG CTG CTG TCG ATA 112 

Met Val Lys Tyr Leu Leu Leu Ser He 
1 5 

TTG GGG CTT GCC TTT CTG AGT GAG GCG GCA GCT CGG AAA ATC CCC AAA 16 0 

Leu Gly Leu Ala Phe Leu Ser Glu Ala Ala Ala Arg Lys He Pro Lys 
10 15 20 25 

GTA GGA CAT ACT TTT TTC CAA AAG CCT GAG AGT TGC CCG CCT GTG CCA 208 
Val Gly His Thr Phe Phe Gin Lys Pro Glu Ser Cys Pre Pro Val Pro 
30 35 40 

GGA GGT AGT ATG AAG CTT GAC ATT GGC ATC ATC AAT GAA AAC CAG CGC 256 
Gly Gly Ser Met Lys Leu Asp He Gly He He Asn Glu Asn Gin Arg 
45 50 55 

ATH 7v A COT AAC ATC GAG AGC CGC TCC ACC TCC CCC TGG AAT 3 04 
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\r Trp Asp Pro Asn Arg Tyr Pro Ser Giu Vai Vaj 
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GCC CAG TGT AGG AAC TTG GGC TGC ATC AAT GCT CAA GGA AAG GAA GAC 4 00 

Ala Gin Cys Arg Asn Leu Gly Cys He Asn Ala Gin Gly Lys Glu Asp 
90 95 100 105 

ATC TCC ATG AAT TCC GTT CCC ATC CAG CAA GAG ACC CTG GTC GTC CGG 448 
He Ser Met Asn Ser Val Pro He Gin Gin Glu Thr Leu Val Val Arg 
110 115 120 

AGG AAG CAC CAA GGC TGC TCT GTT TCT TTC CAG TTG GAG AAG GTG CTG 4 96 

Arg Lys His Gin Gly Cys Ser Val Ser Phe Gin Leu Glu Lys Val Leu 
125 130 135 

GTG ACT GTT GGC TGC ACC TGC GTC ACC CCT GTC ATC CAC CAT GTG CAG 544 
Val Thr Val Gly Cys Thr Cys Val Thr Pro Val He His His Val Gin 
140 145 150 

TAAGAGGTGC ATATCCACTC AGCTGAAGAA GCTGTAGAAA TGCCACTCCT TACCCAGTGC 6 04 

TCTGCAACAA GTCCTGTCTG ACCCCCAATT CCCTCCACTT CACAGGACTC TTAATAAGAC 6 64 

CTGCACGGAT GGAAACAGAA AATATTCACA ATGTATGTGT GTATGTACTA CACTTTATAT 724 

TTGATATCTA AAATGTTAGG AGAAAAATTA ATATATTCAG TGCTAATATA ATAAAGTATT 7 84 

AATAATTTAA AAATAAAAAA AAAAAAAAA 813 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Val Lys Tyr Leu Leu Leu Ser He Leu Gly Leu Ala Phe Leu Ser 
1 5 10 15 

Glu Ala Ala Ala Arg Lys He Pro Lys Val Gly His Thr Phe Phe Gin 
20 25 30 

Lys Pro Glu Ser Cys Pro Pro Val Pro Gly Gly Ser Met Lys Leu Asp 
35 40 45 

He Gly He He Asn Glu Asn Gin Arg Val Ser Met Ser Arg Asn He 
50 55 60 

Glu Ser Arg Ser Thr Ser Pro Trp Asn Tyr Thr Val Thr Trp Asp Pro 
65 70 75 80 

Asn Arg Tyr Pro Ser Glu Val Val Gin Ala Gin Cys Arg Asn Leu Gly 
85 90 95 

Cys He Asn Ala Gin Gly Lys Glu Asp He Ser Met Asn Ser Val Pro 
100 105 110 

He Gin Gin Glu Thr Leu Val Val Arg Arg Lys His Gin Gly Cys Ser 
115 120 125 

Val Ser Phe Gin Leu Glu Lys Val Leu Val Thr Val Gly Cys Thr Cys 



50 



WO 97/07198 



PCT/US96/12897 



130 



135 



140 



Val Thr Pro Val lie His His Val Gin 
145 150 
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What is claimed is: 



1. 



A composition comprising an isolated polynucleotide selected from 



the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ 
ID NO: 1 from nucleotide 38 to nucleotide 1447; 

(b) a polynucleotide comprising a fragment of the nucleotide 
sequence of SEQ ID NO: 1 encoding a protein having biological activity; 

(c) a polynucleotide encoding a protein comprising the amino 
acid sequence of SEQ ID NO:2; 

(d) a polynucleotide encoding a protein comprising a fragment 
of the amino acid sequence of SEQ ID NO:2 having biological activity; 

(e) a polynucleotide which is an allelic variant of SEQ ID NO: 1 ; 

and 

(f) a polynucleotide capable of hybridizing under stringent 
conditions to any one of the polynucleotides specified in (a)-(e). 



2. A composition of claim 1 wherein said polynucleotide is operably 
linked to an expression control sequence. 



3. 



A host cell transformed with a composition of claim 2. 



4. 



The host cell of claim 3, wherein said cell is a mammalian cell. 



5. A process for producing a protein, which comprises: 

(a) growing a culture of the host cell of claim 3 in a suitable 
culture medium; and 

(b) purifying the protein from the culture 



6. 



A protein produced according to the process of claim 5. 



7. A composition comprising a protein, wherein said protein comprises 
an amino acid sequence selected from the group consisting of: 
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(a) the amino acid sequence of SEQ ID NO:2; and 

(b) fragments of the amino acid sequence of SEQ ID NO:2; 
the protein being substantially free from other mammalian proteins. 

8. The composition of claim 7, further comprising a pharmaceutically 
acceptable carrier. 

9. A composition comprising an antibody which specifically reacts 
with the protein of claim 7. 

10. A method for preventing, treating or ameliorating a medical 
condition which comprises administering to a mammalian subject a therapeutically 
effective amount of a composition of claim 8. 

11. A composition comprising an isolated polynucleotide selected from 
the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ 
ID NO:3 from nucleotide 52 to nucleotide 2034; 

(b) a polynucleotide comprising a fragment of the nucleotide 
sequence of SEQ ID NO:3 encoding a protein having biological activity; 

(c) a polynucleotide encoding a protein comprising the amino 
acid sequence of SEQ ID NO:4; 

(d) a polynucleotide encoding a protein comprising a fragment 
of the amino acid sequence of SEQ ID NO:4 having biological activity; 

(e) a polynucleotide which is an allelic variant of SEQ ID NO:4; 

and 

(f) a polynucleotide capable of hybridizing under stringent 
conditions to any one of the polynucleotides specified in (a)-(e). 
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A host cell transformed with a composition of claim 12. 
The host cell of claim 13, wherein said cell is a mammalian cell. 

A process for producing a protein, which comprises: 

(a) growing a culture of the host cell of claim 13 in a suitable 
medium; and 

(b) purifying the protein from the culture 

16. A protein produced according to the process of claim 15. 

17. A composition comprising a protein, wherein said protein comprises 
an amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:4; and 

(b) fragments of the amino acid sequence of SEQ ID NO:4; 
the protein being substantially free from other mammalian proteins. 

18. The composition of claim 17, further comprising a pharmaceutical^ 
acceptable carrier. 

19. A composition comprising an antibody which specifically reacts 
with the protein of claim 17. 

20. A method for preventing, treating or ameliorating a medical 
condition which comprises administering to a mammalian subject a therapeutically 
effective amount of a composition of claim 18. 

21. A composition comprising an isolated polynucleotide selected from 
the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ 
ID NO: 5 from nucleotide 76 to nucleotide 474; 



15. 

culture 
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(b) a polynucleotide comprising a fragment of the nucleotide 
sequence of SEQ ID NO:5 encoding a protein having biological activity; 

(c) a polynucleotide encoding a protein comprising the amino 
acid sequence of SEQ ID NO:6; 

(d) a polynucleotide encoding a protein comprising a fragment 
of the amino acid sequence of SEQ ID NO:6 having biological activity; 

(e) a polynucleotide which is an allelic variant of SEQ ID NO: 5 ; 

and 

(0 a polynucleotide capable of hybridizing under stringent 
conditions to any one of the polynucleotides specified in (a)-(e). 

22. A composition of claim 21 wherein said polynucleotide is operably 
linked to an expression control sequence. 

23. A host cell transformed with a composition of claim 22. 

24. The host cell of claim 23, wherein said cell is a mammalian cell. 

25. A process for producing a protein, which comprises: 

(a) growing a culture of the host cell of claim 23 in a suitable 
culture medium; and 

(b) purifying the protein from the culture 

26. A protein produced according to the process of claim 25. 

27. A composition comprising a protein, wherein said protein comprises 
an amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:6; and 
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28. The composition of claim 27, further comprising a pharmaceutical^ 
acceptable carrier. 

29. A composition comprising an antibody which specifically reacts 
with the protein of claim 27. 

30. A method for preventing, treating or ameliorating a medical 
condition which comprises administering to a mammalian subject a therapeutically 
effective amount of a composition of claim 28. 

31. A composition comprising an isolated polynucleotide selected from 
the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ 
ID NO:7 from nucleotide 67 to nucleotide 348; 

(b) a polynucleotide comprising a fragment of the nucleotide 
sequence of SEQ ID NO:7 encoding a protein having biological activity; 

(c) a polynucleotide encoding a protein comprising the amino 
acid sequence of SEQ ID NO:8; 

(d) a polynucleotide encoding a protein comprising a fragment 
of the amino acid sequence of SEQ ID NO:8 having biological activity; 

(e) a polynucleotide which is an allelic variant of SEQ ID NO:7; 

and 

(f) a polynucleotide capable of hybridizing under stringent 
conditions to any one of the polynucleotides specified in (a)-(e). 

32. A composition of claim 3 1 wherein said polynucleotide is operably 
linked to an expression control sequence. 

33. A host cell transformed with a composition of claim 32. 

34. The host cell of claim 33, wherein said cell is a mammalian cell. 
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35. A process for producing a protein, which comprises: 

(a) growing a culture of the host cell of claim 33 in a suitable 
culture medium; and 

(b) purifying the protein from the culture 

36. A protein produced according to the process of claim 35. 

37. A composition comprising a protein, wherein said protein comprises 
an amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:8; and 

(b) fragments of the amino acid sequence of SEQ ID NO:8; 
the protein being substantially free from other mammalian proteins. 

38. The composition of claim 37, further comprising a pharmaceutically 
acceptable carrier. 

39. A composition comprising an antibody which specifically reacts 
with the protein of claim 37. 

40. A method for preventing, treating or ameliorating a medical 
condition which comprises administering to a mammalian subject a therapeutically 
effective amount of a composition of claim 38. 

41. A composition comprising an isolated polynucleotide selected from 
the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ 
ID NO:9 from nucleotide 75 to nucleotide 356; 

(b) a polynucleotide comprising a fragment of the nucleotide 
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(d) a polynucleotide encoding a protein comprising a fragment 
of the amino acid sequence of SEQ ID NO: 10 having biological activity; 

(e) a polynucleotide which is an allelic variant of SEQ ID NO:9; 

and 

(f) a polynucleotide capable of hybridizing under stringent 
conditions to any one of the polynucleotides specified in (a)-(e). 

42. A composition of claim 41 wherein said polynucleotide is operably 
linked to an expression control sequence. 

43. A host cell transformed with a composition of claim 42. 

44. The host cell of claim 43, wherein said cell is a mammalian cell. 

45. A process for producing a protein, which comprises: 

(a) growing a culture of the host cell of claim 43 in a suitable 
culture medium; and 

(b) purifying the protein from the culture 

46. A protein produced according to the process of claim 45, 

47. A composition comprising a protein, wherein said protein comprises 
an amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO: 10; and 

(b) fragments of the amino acid sequence of SEQ ID NO: 10; 
the protein being substantially free from other mammalian proteins. 

48. The composition of claim 47, further comprising a pharmaceutical^ 
acceptable carrier. 



49. A composition comprising an antibody which specifically reacts 
with the protein of claim 47. 
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50. A method for preventing, treating or ameliorating a medical 
condition which comprises administering to a mammalian subject a therapeutically 
effective amount of a composition of claim 48. 

51. A composition comprising an isolated polynucleotide selected from 
the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ 
ID NO: 1 1 from nucleotide 86 to nucleotide 544; 

(b) a polynucleotide comprising a fragment of the nucleotide 
sequence of SEQ ED NO:l 1 encoding a protein having biological activity; 

(c) a polynucleotide encoding a protein comprising the amino 
acid sequence of SEQ ID NO: 12; 

(d) a polynucleotide encoding a protein comprising a fragment 
of the amino acid sequence of SEQ ID NO: 12 having biological activity; 

(e) a polynucleotide which is an allelic variant of SEQ ID 
NO:ll;and 

(f) a polynucleotide capable of hybridizing under stringent 
conditions to any one of the polynucleotides specified in (a)-(e). 

52. A composition of claim 51 wherein said polynucleotide is operably 
linked to an expression control sequence. 

53. A host cell transformed with a composition of claim 52. 

54. The host cell of claim 53, wherein said cell is a mammalian cell. 

55. A process for producing a protein, which comprises: 

(a) growing a culture of the host cell of claim 53 in a suitable 



56. A protein produced according to the process of claim 55. 



WO 97/07198 



PCT7US96/12897 



57. A composition comprising a protein, wherein said protein comprises 
an amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO: 12; and 

(b) fragments of the amino acid sequence of SEQ ID NO: 12; 
the protein being substantially free from other mammalian proteins. 

58. The composition of claim 57, further comprising a pharmaceutically 
acceptable carrier. 

59. A composition comprising an antibody which specifically reacts 
with the protein of claim 57. 

60. A method for preventing, treating or ameliorating a medical 
condition which comprises administering to a mammalian subject a therapeutically 
effective amount of a composition of claim 58. 
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